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Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 

1. Sends the Brief Report, limited to one printed 
page and prepared according to the specification 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation 


Specifications 


Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 


author’s lines, must not exceed 85 lines av- 
eraging 42 characters and spaces in length. 
Set the typewriter margins for short lines of 
42 characters, which are 3.5 inches long in 
elite typing, and 4.2 inches long in pica. 

The manuscript of the Brief Report must 
be double spaced throughout. Except for its 
short lines, it follows the standard style of 
the 1957 revision of the APA Publication 
Manual. Headings, tables. and references are 
avoided or, if essential, must be counted in 
the 85 lines. Each Brief Report must be ac- 
companied by a footnote ir: the style below, 
which is typed on a separate sheet and not 
counted in the 85-line quota: 


1An extended report of this study may be ob- 
tained without charge from John Doe (giving the 
author’s full name and address) or for a fee from 
the American Documentation Institute. Order Docu- 
ment No. from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$— for microfilm or $ for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


Extended report. Because the extended re- 
port is intended for photoduplication, and is 
not copy to be sent to a printer, its style 
should differ in several ways from that of 
other manuscripts: (a) The extended report 
should be typed with single spacing for 
economy in duplication. (6) Tables and fig- 
ures should be placed adjacent to the text 
which refers io them. A caption should be 
typed below each figure. (c) Footnotes should 
be typed at the bottom of the page on which 
reference is made to them. In other respects, 
the full report is prepared in the style speci- 
fied by the Publication Manual. 





THE ROLE OF THE INTERNSHIP IN THE RESEARCH 
TRAINING OF THE CLINICAL PSYCHOLOGIST 


JULES D 


HOLZBERG! 


nec ut State Ho pital 


Training in 
j 


clinical psychology has con- 


sisted of two aspects a minimum of 3 

university where student 
generic basic psy- 
chology, training in theory and techniques 
relevant to clinical psychology, and practicum 


experience in research; (4) at least 1 year of 


years at a the 


receives a education in 


practicum experience (internship) in a clini- 
il center to provide substance for the more 

‘mic aspects of the student’s clinical 
training. Within this organization of clinical 
training, the university has been viewed as 
basically responsible for the training of the 
clinician as a psychological researcher, while 
the internship is 


generally conceived of pro- 
viding only minimal support for the research 


spects of a clinical student’s training 
However, one n iy accept this 


ponsibility of the university 


basic 
while at 
ne time 


questioning the adequacy of a 


nception that denies to the internship a 
I t 
signihcant 


+ 


responsibility in the training of 


ve clinician as a resear( The university 
unquestionably better 


yoint of 


equipped from the 
view of facilities and personnel to 
provide training in the definition and applica- 
tion of psychological concepts, in the use of 

aH 


library as a research tool, in experimental 


research design, in tl utilization of 
methods: of sampling 


merous other requisite 


tical analyses, 
itions, and the 1 

petent reseat 

internship center has a sig- 

ificant role to play in the research training 
yf the clinical student in at least two signifi- 


to sensitize the student to 


ner 


However, the 


respects > (@) 
The author is to Solomon E 
manuscript and helped make it a 
document. However, the author 
assumes full responsibility for the ideas expressed. 


indebted Feldman, 
ho read the 


more readable 


the meaningful research problems in the 
clinical field and to demonstrate how clinical 
skills and techniques may be adapted to re- 
search goals, while providing a framework 
that research on clinical problems 
without destroying their clinical significance; 
and (b) to contribute to the building of a 
that effectively integrates the 
“researcher” and “clinician.” The 
first of these aspects is by no means insig- 
nificant, for the university may, because of 
its frequent insistence on the 
“researchability” of a problem, 
clinical student into research 
the problem is “neat,” 


fosters 


self-concept 
role of 


absolute 

force the 
where 
but the clinical sig- 
This is not meant as 
since the 
function of the 
university is to provide generic training in the 
and there may 

focusing this training on 
problems. We must, 
be prepared tor one possible consequence of 
this training, i.e., this criterion of research- 
ability may become so rigidly fixed that the 
clinician may never undertake research on a 
clinical rare indeed 
that a clinical problem can hope to attain the 
level of researchability that meets the stand- 


in the 


area 


nificance quite meager. 
a criticism of 
author 


university training, 
recognizes that the 


development of research skills, 


be real merit in 


hah] —— 
researchable however, 


problem, because it is 


ard set for students typical university 
The second research 
the internship is related to the author’s ob- 
that clinical interns frequently 
come to the internship with a strong sense 
of inferiority as a researcher. Whether this 
is a function of the preselection of clinical 
students (in that students entering into. the 
clinical area may do so because of intellectual 
and personality reasons that are incompatible 
with a research role) or whether this is a 


training function of 


servations 


185 





186 Jules D 
function of the absorption of attitudes 
toward the clinician as a research worker 
that unfortunately permeate many of our 
universities, the result often seems to be 
that when the clinical student enters into 
the internship, he does so with a self-concept 
that is relatively weak with regard to the 
integration of research as a practicing role. 

The internship frequently does not help to 
counter such feelings of inferiority. It may 
fail to provide a “culture” which rewards 
the development of a research role for the 
clinician by encouraging other role behaviors, 
such as that of diagnostician and therapist. 
The practicing clinician in the internship 
center may further contribute to this self- 
concept by failing to provide an adequate 
model of the clinician as a research worker. 
He is usually not involved in research and 
may actually give voice to antiresearch atti- 
tudes, so that we have the vicious “cultural” 
cycle of the clinician, having learned to reject 
the role of research for himself, now fostering 
the same attitudes in the students sent to 
him for training. 


THE ACQUISITION OF A RESEARCH ROLE” 


I have been using two concepts which 
should be more precisely defined. One is that 
of “role” of a researcher, i.e., those actions 
which are performed or not performed to 
confirm the occupancy of the research role. 
The second, “self-concept,” may be defined 
as the values used by a person to describe 
and evaluate himself, and here I am specifi- 
cally referring to self-descriptions of the 
clinician with regard to values pertinent to 
being a research worker. These two concepts 
are functionally quite interrelated since atti- 
tudes toward the self affect the kind of role 
that will and can be played, and conversely, 
the success with which one plays or does not 
play a particular role will in turn affect the 
self-concept. 

Two methods of role learning may be 
delineated, i.e., intentional learning and in- 
cidental learning. Intentional learning is 
explicit learning and the intentional learning 
of a research role occurs primarily in the 

2A number of the ideas presented in this paper 


were stimulated by Sarbin’s (1954) excellent dis- 
cussion of role theory. 


. Holzberg 


university. In incidental learning, the indi- 
vidual adopts “the prevailing pattern.” It 
is the author’s thesis that incidental learning, 
much of which takes place in the clinical 
center, plays a significant part in the ultimate 
development of the self-concept of the 
clinician. Incidental learning occurs essen- 
tially through a process of identification 
(unconscious) or imitation (conscious). This 
places great emphasis on those in the 
emerging clinical psychologist’s environment 
who have securely integrated the role of 
researcher into their professional self-concept. 
It is here that an important defect exists in 
the opportunities for learning a research role 
since the figures who securely integrate the 
research role into their self-concepts are fre- 
quently unavailable or minimally available 
in the internship. For the most part, there 
are figures who are diagnosticians, therapists, 
administrators, etc. The intern can identify 
with figures that are not in his immediate 
visual field, such as significant researchers in 
the field of psychology. However, we have 
perhaps failed to emphasize the fact that 
imitation, which may be more important in 
the development of the professional self- 
concept, does require the figure to be in the 
visual field. 

This has been implicitly recognized in 
many clinical centers with the result that 
more clinical centers are today resorting to 
the use of research consultants who serve the 
important role of providing figures with a 
secure self-concept that incorporates the role 
of researcher. These are, for the most part, 
research psychologists from universities who 
visit the internship center at varying inter- 
vals, spending several hours on each visit and 
advising on theoretical conceptualizations, 
research designing, statistical analyses, and 
other issues pertinent to research. But even 
the use of consultants may be insufficient. 
Frequently, these are fleeting figures who 
appear irregularly, who may not actually be- 
come involved in research within the clinical 
center, and who are often not clinicians but 
are the prototypes of those academic figures 
who have originally contributed to feelings 
of inferiority about the clinician’s research 
role. 
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This may be a considerable oversimplifica- 
tion. One may still have figures available who 
provide appropriate models for identification 
or imitation, but there may still not be a 
basic motivation for learning a research role. 
From where does the motivation to learn and 
practice the role of researcher stem? Such 
motivation may derive from factors such as 
financial rewards, status, advancement, and 
other extrinsic stimulants. However, we can- 
not ignore the fact that there may be more 
intrinsic factors operating to provide motiva- 
tion for the learning of a research role. 
Redlich and Brody (1955) have emphasized 
research motivation as being derived from 
“powerful unconscious drives to gratify infan- 
tile curiosity and the wish for omniscience” 
(p. 234). Kubie (1953) has carried this 
further and has attempted to relate uncon- 
scious strivings to the various aspects of 
research such as the problem selected for 
study, the specific research techniques uti- 
lized, the hypothesis selected for verification, 
and other aspects of the research process. 

In addition to learning the skills of re- 
search, including clinical research skills, and 
in addition to learning the role of researcher 
and effectively integrating it into one’s self- 
concept, there is another 


type of learning 


that is a joint responsibility of the university 
and the clinical center. This is the learning 
of the ethics of research. I refer here to the 
obligations of the researcher to his subjects, 
to his profession, to the organization which 
employs him, to his collaborators, etc. Too 


often, one observes the researcher, be he 
clinical or not, whose competence in terms 
of knowledge and skills is obviously of a 
high order, but who has failed to integrate 
a valid system of ethical values into his 


research activity. 


PROBLEMS OF ROLE ENACTMENT 


One of the complex adjustment problems 
faced by the young clinician is the need to 
learn a multiplicity of roles, e.g., diagnos- 
tician, therapist, teacher, researchist, etc. At 
times, there may be conflict between certain 
of these roles, which may then result in the 
relinquishing of one of the roles in order to 


resolve the conflict. Where one of the roles 
is that of research, this frequently leads to 
the relinquishing of the research role because 
the clinical center frequently supports this 
role less than others. 

Role conflict occurs if the individual 
occupies two positions simultaneously, and 
when the role expectations of one position 
are incompatible with the role expectations 
of the other. It is rare indeed that situations 
arise which present this type of role conflict 
as it pertains to the practice of research. 
Where one finds the verbalization of such 
role conflict, one must suspect that there 
exists another explanation, other than a real 
role conflict, since it is rare that the indi- 
vidual is expected to perform a research role 
and any other role simultaneously. The role 
of clinician and the role of research worker, 
even where these are successfully integrated, 
are not performed simultaneously, but are 
performed at different times, in different 
settings, in relation to different individuals, 
etc. The author views the multiple roles of 
the clinical psychologist not as mutually in- 
compatible, but mutually complementary. 
The ability to successfully perform a multi- 
plicity of professional roles permits one to 
enrich the particular role that the psycholo- 
gist may be enacting at a particular time. 

It has frequently been stressed that the 
attitudes and skills required for research are 
significantly different from those required for 
clinical activity. The problem here is not that 
there is a basic incompatability between 
research and clinical activity, but that certain 
types of research, usually of the experimental 
laboratory type, are strikingly different from 
what is involved in clinical practice. How- 
ever, there are many problems which cannot 
at the moment be studied effectively in a 
strictly experimental laboratory manner, and 
here the clinical method, with its latitude, 
with its less standardized approach, with its 
greater subjectivity, can be marshaled to 
fulfill a research objective. The problem is 
one of adapting the clinical method so that 
it can become a useful research method. In 
fact, the author has stressed elsewhere the 
fundamental resemblance of the clinical and 
the scientific methods (Holzberg, 1957). 
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iot being it the clinical 
gist must enact all of his roles with 
me degree involve- 
lividual who can main- 
and intense 
in all of the 
expected to perform. 
have the highest 
in 
must inte- 
availability of 

h role. Clinical psychologists vary 


. 4a 
suggested that 


of intensity or of 

It is the rare in 
intense affective involvement 
expenditul 
roles he may 
The 
involvement in 
the repertoire of 
grated 


physic il effort 
be 
needa not 
h, but 


there 


linician 


researc somewhere 


roles be 
into the clinician the 
the resear< l 
in thei: of the role such 
that located continuum 
extending from almost automatically enacting 
the the 
own 


enactment tl research 


each may be on a 


role, to the other extreme where 


clinician is very self-conscious of his 


role enactment 
the 


The problem is how to make 
clinician, and the clinical 


the 


particularly 


student, become les self-conscious of 


enactment of the research role. 


To perform a role, the individual must 


clearly know the expect 
whom 


those with 


itions oO! 


he interacts in this role. Unless the 


internship staff concept 


as incorporating resear 


ment 
TT@ it 


of the research 


problems. Thus, he may act out a role which 


may consider i not 


ippropriate, if 
proj 


bizarre. Of crucial importance here 


overt acts of ther than 


verbalizations. If others are doing research, 


if research is being actively si rted in the 


agency, this is a far mo int factor 


. 1 
; 


the attituc 


mere verbalized 


vealing es earch 
rol han 

Where the professional structure demands 
1 multiplicity of role performances as in 


clinical there must “‘flexi- 
that he 


one role 0 
this is 


psychology, 
bility” in 
ily 


another. It 


individual, 
shift 
is apparent that 
of self that 
difficulty in the enactment of the 


the 


able 


so 


rea be to from 
one 
dimension the may make for 
research 
role, simply because the individual may be 
lacking in this capacity for flexibility. At the 
same time, the individual must possess suf- 
ficient capacity to erect barriers 
particular role being enacted, so that the role 


does not shift inappropriately. 


iround a 


DEFENSIVE ro RESEARCH ROLE 


If the nd qualities of a given 
role ar with the self-concept 
maintained by the individual, 
the probability that the 


this enhances 

individual will be 

tent with the 

1e thesis of the writer 
+h ot 

» clinician 


consis 


able to | 
role expect 
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searchist with 
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are not 
and resort to 
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exemplified by the resort the 
that research 
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of others. This may ce the form 
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f in destructive 
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another 
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acceptable stat 
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they 
research 
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never get inv h. Fre- 


role as 
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regarding 
research 


certain « ompeten¢ ies 


have 


which should lead to ac- 





Internship in Research Training 


sense of inferiority and the potential feeling 
that their research activity will not succeed 


that one finds this tragic discrepancy between 
genuine ability in research and an inability 
to function in a research role. At the other 
extreme is the “dilettante” who frequently 
does have some research interest and talent, 
but who wants to research every problem here 
and now. This frequently masks the in- 
security and inferiority of the psychologist 
who is anxious that his research effort yield 
a significant contribution, and so to buttress 
his own security seeks to embark on a second 
research project simultaneously. But, even 
this does not assuage his anxiety so that he 
must turn now to a third project, hoping that 
in some way a multiplicity of research ac- 
tivities will at least yield one research project 
which will earn him success. This multiplicity 
of efforts frequently leads to activity but no 
sustained research. 


THe CLINICIAN AS A RESEARCHER 


A question that frequently is posed is 
whether all clinical psychologists should be 
expected to do research. Certainly, at the 
very least, we should expect that the indi- 
vidual who has received a PhD in clinical 
psychology approack his clinical tasks with 
an orientation geared to recognize problems 
that remain unanswered and questions seek- 
ing solutions. Perhaps this is as much as we 
should expect from the bulk of practicing 
clinical psychologists—this capacity to recog- 
nize and be aware of the problems needing 
solution. Thus, an intern may conceivably 
have a successful experience with regard to 
research if he can spend his year within a 
setting that, while it does not practice re- 
search, at least is self-questioning and self- 
analyzing with regard to its beliefs and its 
techniques. However, it is the thesis of the 
author that the more significant research 
experience during the internship will be ac- 
complished where the setting is itself one 
where the psychologists are actively involved 
in research activity. 

The controversy as to whether we should 
be training people to do research or to 
evaluate published research still continues. 
Even if we could agree that this was a 
legitimate conflict, in what way would the 
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training for doing research be different from 
the training that would enable one to evaluate 
research? Is it possible to evaluate research 
without systematic involvement in research 
that brings home to one realistically the im- 
portant issues with which one is concerned in 
research activity? 

It is important to stress that there are 
many skills that enter into being a well- 
rounded research person. Among these are: 
the capacity to recognize meaningful research 
problems; the capacity to engage in the 
creative thinking involved in building hy- 
potheses and deducing implications from 
them; the capacity to integrate and utilize 
theory to refine a problem; the capacity to 
develop a research plan with appropriate 
controls that would make possible the testing 
of the hypotheses; the capacity to assess the 
nature of populations and the types of 
samplings to be made; the capacity to select 
the appropriate techniques and instruments 
to be used in the measuring process of the 
research; the capacity to deal with subjects 
not as clinical but as research entities; the 
capacity to record faithfully responses made 
by subjects; the capacity to tabulate, score, 
and analyze the data resulting from the 
research; and the capacity to communicate, 
both in written and oral form, the nature 
of the investigation and the results. Clearly, 
there are few people who possess all of these 
skills to a sufficient degree to qualify as the 
well-rounded research clinician. Here indi- 
vidual differences must be recognized and 
accepted. Individuals vary quantitatively in 
the extent to which they possess the various 
skills that have been enumerated, and a 
place in research exists for all people trained 
as clinical psychologists, each one making a 
contribution commensurate with his capaci- 
ties and within his limitations. Group re- 
search is perhaps a prime way in permitting 
every clinical psychologist a research role. 


RESEARCH TRAINING FOR THE INTERN 


It seems imperative that the setting in 
which the intern receives his first major 
clinical experience be one in which research 
in all of its aspects is a vibrant part of the 
program. This is especially true with regard 
to thinking about research. Toward this end, 
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we have found it advisable to have frequent 
conferences devoted either to ongoing re- 
search or published research. Even in those 
settings that may not be able to integrate 
internes into ongoing research, it seems 
advisable that this provision for regular 
discussions should become _institutionalized 
as part of the program. 

One problem that presents itself in many 
clinical settings is that of providing time for 
research activity in the face of service de- 
mands. We have utilized the technique of re- 
leasing the intern every third week from diag- 
nostic responsibilities in order to permit him 
to work on research. The importance of pro- 
viding time is that it is a concrete demonstra- 
tion of the department’s and the agency’s 
attitudes toward research. This open accept- 
ance of research activity as a legitimate func- 
tion of the clinician will minimize the anxiety 
or guilt in the individual who feels that he is 
sacrificing services for patients by engaging in 
research. 

Another problem with which we have strug- 
gled has been that of organizing interns so 
that they may participate in group research 
which provides a richer learning experience 
for the intern than he can normally expect 
from individual research. The opportunity for 
continued conferences on the group research 
introduces a significant dimension into the 
training of the intern that can hardly be 
matched when the intern is working alone on 
his own problem, even where he is being 
supervised. Working in the group on a group 
research does not minimize the opportunity 
for individual supervision or for contact with 
consultants, but rather adds to this the in- 
volvement with a group of other psychologists 
with whom the intern can interact on research 
in a continuous way over a year. Besides the 
potentially greater intellectual stimulation 
made possible by group research, there is the 
support that is provided in dealing with the 
many complex problems that arise in the 
course of research activity. 

The use of research consultants is felt to be 
a very valuable part of the research training 
experience for the intern. Not only do they 
add to the research atmosphere of the depart- 
ment, but also these people, when carefully 
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selected, can provide genuine help on research 
problems from the theoretical conceptualiza- 
tion to the technical aspects of designing the 
research and analyzing the data. The use of 
these consultants may also help the intern to 
understand that there may not be as great a 
cleavage between the university research psy- 
chologist and the “field” clinician as many 
interns seem to feel. By observing the fact 
that these two can communicate, share, and 
sometimes collaborate to their mutual benefit, 
the intern learns that there need not be this 
sharp schism between clinical activity and re- 
search as it is often defined in the university 
research center. 

A problem that must frequently be faced is 
the latent feeling in some interns that, if they 
engage in agency research, they will somehow 
be misused. This misuse may occur at two 
levels: (a) The intern may fear that he will 
be relegated to tasks that are either demean- 
ing or more appropriate to an individual with 
less training. This suggests that, in providing 
a training experience in research for the in- 
tern, it must be an experience that will be 
truly an educational one. He cannot be used 
simply as an assistant or as a clerical worker, 
but must share directly in the research from 
its conceptualization to its execution. (6) The 
intern may fear that he will not be rewarded 
for his contributions to the research in the 
form of authorship. In clinical activity, this 
concern is not present. Whether he is per- 
forming as a diagnostician or therapist, the 
intern is explicitly recognized for his services. 
Since the diagnostic workup and the record of 
treatment of the patient in the medical file 
bear his name, this serves to provide the in- 
tern with formal recognition of his labors. In 
research, this immediate recognition is not 
readily apparent. It has been our experience 
that with some interns this is a significant 
concern that may contribute to the intern’s 
reluctance to become involved in hospital re- 
search. We have found that it is well to be 
explicit with interns with regard to the re- 
wards that they may expect from participa- 
tion in the research. This has most usually 
meant that the intern is accepted as a collab- 
orator from the point of view of authorship. 

With the provision for the intern to work 
on a clinical research problem during the in- 
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ternship, there carries with it a requirement 
that such research should be supervised by 
someone who not only can assess the research 
from a technical point of view (design, sam- 
pling, etc.), but can also make the research as 
clinically meaningful as possible. It is the rare 
intern who can function independently with 
regard to research activity, even when he has 
already completed his doctoral dissertation. 
We have recently introduced, on a trial basis, 
a research training program for interns which 
provides for the joint supervision of interns’ 
research by a university experimentalist and 
by a clinician. 

Part of the responsibility of the research 
worker is to communicate his research and its 
findings effectively both in written and in oral 
form. It is part of the training of the intern 
that once he is involved in his research he is 
expected to prepare reports to be presented at 
group conferences at which consultants may 
be present. 

CONCLUSIONS 

It may be possible to reconceptualize the 
problem of how one trains the intern to per- 
form a research role effectively. In a sense, 
we may say that the individual learns to do 
research most effectively when he is globally 
involved in this process of learning. That is, 
he learns first by feeling that research is an 
activity personally and professionally reward- 


ing and by feeling comfortable in performing 
this research role because it does not conflict 
with other roles or with his own self-concept; 
he learns by thinking research, by being in- 
volved in activities that will permit him to 
extend his intellectual horizons with regard to 
research and particularly clinical research; 
and finally, he learns by doing research, by 
actually becoming involved in research ac- 
tivity. Our emphasis here has been that the 
internship’s responsibility with regard to de- 
veloping the research role of the clinician 
must be to permit the intern to begin to feel 
like a research person, to provide experiences 
that will extend his thinking about clinical 
problems and methods for dealing with them, 
and to provide an experience of doing in 
which he may practice the skills of a research 
worker. 
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COMPARABILITY OF INTELLIGENCE QUOTIENTS 
OF MENTAL DEFECTIVES ON THE WECHSLER 


ADULT INTELLIGENCE SCALE AND THE 


1960 


REVISION OF THE STANFORD-BINET’ 
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\ number of studies comparing the 
Wechsler-Bellevue (W-B) and the Wechsler 
Intelligenée Scale for Children (WISC) with 
the Stanford-Binet (S-B) have been reported. 
Correlation coefficients between IQs from 
these Wechsler scales and the S-B have been 
of the same order (.6 to .9) in normal, neuro- 
psychiatric, and mentally retarded popula- 
tions (Alderdice & Butler, 1952; Benton, 
Weider, & Blauvelt, 1941; Goldfarb, 1944; 
Gothberg, 1949; Mitchell, 1942: Nale, 1951; 
Sandercock & Butler, 1952; Stacey & Levin, 
1951; Wechsler, 1944). Although some stud- 
ies have indicated IQs from the WISC to be 
slightly higher than those from the S-B, the 
two instruments give fairly comparable scores 
(Frandsen & Higginson, 1951; Gehman & 
Matyas, 1956; Harlow, Price, Tatham, & 
Davidson, 1957; Weider, Noller, & Scramm, 
1951). A number of studies have indicated 
that W-B IQs are consistently higher than 
S-B IQs of older subjects (Halpern, 1942; 
Mitchell, 1942) and that this discrepancy be- 
comes larger as age increases and intellectual 
level decreases (Bensberg & Sloan, 1950; 
Kutash, 1945; Mundy & Maxwell, 1958). 

Only one study, that of Wechsler (1958), 
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compares the Wechsler Adult Intelligence 
Scale (WAIS) with the S-B. In a sample of 
52 male prisoners, aged 16 to 26 years, the 
mean S-B IQ was five points higher than the 
mean WAIS IQ, and the correlation between 
the two IQs was .85. The present study com- 
pares adult hospitalized retardates of varying 
age and intelligence level as to their IQs on 
the WAIS and the 1960 revision of the Stan- 
ford-Binet Form L-M. 


METHOD 


The 
subjects 
18 year 


sample consisted of 18 retarded 
tate hospitals who were 
i ] of familial 
ion. The sample 
1al age groups and three 
ler to obtain 12 equal- 


mentally 
in three Califor: 
or older and a diagnosis 
or undifferentiated n tardat 
was Classified into fc 
unequal IQ categorie 
1 subgroups of 15 s each. The age group- 
} 45-54, and 55-73 
based on the 
46 and below 
» WAIS first an 
ven subject, 
ts. A coun- 
ployed in assigning 
in order to control for 
possible e: lo control for changes 
>Thus subjects with known brain damage 
excluded from the stud; 
On the WAIS, a subject must 
Total Scaled score of 


table of norm 


were 


achieve a minimal 
11 to obtain an IQ from the 
Consequently, any subject achieving 
less than this minimum score was excluded from the 
sample since only prorated 


IQs could be estimated 


and presumably the WAIS would not be an appro- 
priate test for such a sub 

4 This 
adopted in order to circu 
tistical 
occur if IQ level wv 


procedure for determining IQ level was 
ivent the problem of sta- 
mean which would 


rmined by only one test 


regression tow a the 
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TABLE 1 


F WECHSLER ADULT INTELLIGENCE SCALE 
-Brvet IQs ror Eacu AGE LEvEI 


WAIS IQ 
Mean SD 


58.42 8.40 
59.44 10.20 
60.29 8.31 


63.56 9 88 


which might occur in intellectual functioning over an 
extended period of time, all subjects were given both 
tests within a 12-month period, with the great ma- 
jority of subjects both test 
month period 


receiving within a 4- 

To secure data that might help in developing an 
hypothesis regarding the relation of social com- 
petency to intellectual functioning on the WAITS and 
S-B, all subjects were administered the Vineland 
Scale of Social Maturity (Doll, 1953). In addition, 
an adapted form of a Scale of Minimal Social Be- 
havior (Farina, Arenberg, & 1957) was ad- 
ministered, but because of the lack of variability in 
scores from this instrument no us¢ 
of the results 


Guskin, 


could be made 


RESULTS 


\n analysis of variance of the difference 
scores between WAIS and S-B IQs was per- 
formed. The results indicated that age was 
significant in determining the magnitude of 
the discrepancy between the two IQs (F 

16.74; df = 3; p< .001), but neither IQ 
level nor the interaction between IQ level and 
age was significant.® 

Table 1 shows the comparison of WAIS 
and S-B IQs for each age level. Of the 180 
subjects examined only 3 had an S-B IQ 
higher than their WAIS IQ, the largest dif- 
ference being five points. The ¢ values for the 
significance of the difference between corre- 
lated means indicated that the mean WAIS 
IQ was significantly larger (beyond the .001 
level) than the S-B IQ at each age level. The 
correlation coefficients WAIS and 


5 Analyses of the difference scores between WAIS 
Verbal IQ and S-B IQ, and between WAIS Perform- 
ance IQ and S-B IQ were also performed. The results 
were virtually identical with the analysis of the dif- 
ference scores between WAIS and S-B IQs. More- 
over, these two sets of difference scores were of the 
same magnitude as difference scores between WAIS 
Full Scale IQs and S-B IQs 


between 


44.02 
45.84 
43.51 8.61 


1.00 8 33 


Mean 


975 


10.99 


15.00 
12.04 
18.64 


129? 


S-B IQs ranged from 
square test (Edwards, 
the differences 
nificant (, 


736 to .77 
1950) 
them 


df 3° p 


7 and a chi 
indicated that 
were not sig- 


>» 95). 


among 

Table 2 sets forth the difference scores be- 
tween WAIS IQ and S-B IQ by age level. The 
t tests indicated that the mean difference 
scores for the three younger age groups were 
significantly smaller (< .01) than for the 
oldest age group. Between the ages of 18 and 
54 years, the WAIS IQ averaged 15 points 
higher than the S-B IQ, whereas in subjects 
over 55 years the difference averaged 23 
points. The standard deviation of the differ- 
ence score was approximately 6. The WAIS, 
unlike the S-B, takes systematic account of 
the lower performance of older adults; hence, 
the older the adults (beyond a peak between 
25 and 30) the lower the absolute 
performance and hence the larger 
crepancy S-B WAIS. As the 
WAIS standardization population included 
adults of a wide age range whereas the S-B 
did not, it would be logical to assume that the 
WAIS IQ is a more accurate measure of in- 


level of 
the dis- 


between and 


rABLE 2 


DIFFERENCE Scor! BETWEEN WE 
INTELLIGENCE SCALE IQ 


HSLER ADULT 
AND STANDARD-Binet IQ 


, 


~—S tos 


>> 


5 to 32 


10 to 36 


ote For 
the WAIS 10 
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TABLE 3 


COMPARISON OF IQs FROM THE WECHSLER ADULT INTELLIGENCE SCALE AND STANFORD-BINET WITH 


SociaAL AGES FROM THE VINELAND SCALE OF SOCIAL 


WAIS 
Mean IQ 


S-B . 
Mean IQ Mean Age 
58.42 
59.44 
60.29 
63.56 


44.02 10 
45.84 10 
43.51 9- 
41.00 g- 


telligence. It has been suggested, however, 
that there is a spurious allowance for “nor- 
mal” deterioration with age for retardates on 
the Wechsler scales (Bensberg & Sloan, 
1950). Until validity studies, preferably of 
the longitudinal type, of the WAIS with re- 
tarded populations are made, the problem of 
differential rate of intellectual decline depend- 
ing on level of intellectual functioning re- 
mains unanswered. 

The regression equations for predicting the 
IQ on one test from that on the other for the 
three younger age groups appeared sufficiently 
similar to warrant the use of the following 
regression equations for subjects between 18 
and 54 years: 


Predicting WAIS IQ from S-B IQ: 
Y = .68X + 29.15 
(SE. = 6.15) 
Predicting S-B IQ from WAIS IQ: 


Y = .79X — 2.45 


(SE, = 6.61) 


For subjects 55 years and older, the following 
regression equation should be used: 


Predicting WAIS IQ from S-B IQ: 


Y = .92X + 25.84 
(SE, = 6.22) 


Predicting S-B IQ from WAIS IQ: 


Y= 65X — 31 


(SE, 


Vineland Scale of Social Maturity 


MATURITY 


Correlation 
Coefficients 


WAIS 
IQ & 
Social 
Age 


S-B 
IQ & 
Social 
SD Range 
1-6 5-6 to 13-3 
1-9 5-5 to 13-0 
2-0 0 to 12-10 
2 


1 5-5 to 13-3 


265 
438 
638 


534 


In order to gain some knowledge of the 
relation between social competency and the 
IQs from the two intelligence scales, com- 
parisons of the Social Ages (SA) from the 
Vineland Scale of Social Maturity were made 
with the IQs from the WAIS and the S-B. 
These data are presented in Table 3. Chi 
square tests indicated that the correlation co- 
efficients between WAIS IQ and Vineland SA 
and between S-B IQ and Vineland SA were 
homogeneous for the four age levels (,’ 

- 5.249 and 9.264, respectively: df = 3: p 
> .01). For the total sample the correlation 
coefficient between Vineland SA and WAIS 
IQ was .450, and between Vineland SA and 
S-B IQ was .405. A nonsignificant ¢ value of 
.939 for the significance of the difference be- 
tween correlated correlations indicated that 
the WAIS and S-B correlated equally well 
with Vineland SA. The data shown in Table 3 
suggest a trend for WAIS IQs to increase, and 
S-B IOs and Vineland SA to decrease, with 
age. According to analyses of variance these 
observed trends all proved to be nonsignifi- 
cant at the .01 level (WAIS: F = 2.54; S-B: 
F=2.01; Vineland SA: F =1.29). More 
thorough investigation would be necessary in 
order to answer the question of the relation 
between social competency and intelligence as 
measured by these two scales. 


SUMMARY 


This study sought to determine the effect 
of age and level of retardation on the com- 
parability of IQs from the Wechsler Adult 
Intelligence Scale and the 1960 revision of the 
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Stanford-Binet. In addition, a measure of 

social competency was related to the IQs 

from the two scales. It was determined that 

age, but not level of retardation, was signifi- 

cant in determining the magnitude of the 

difference between WAIS and S-B IQs. WAIS 

IQs averaged 15 and 23 points higher than 

S-B IQs for subjects 18-54 years and 55-73 

years, respectively. Regression equations were 

calculated to translate the IQ from one test to 

the other test. The WAIS and S-B IQs cor- 

related equally well with Social Ages from the 

Vineland Scale of Social Maturity. 
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The ability to judge correctly the sizes of 
objects despite variation in viewing distance, 
i.e., size constancy, represents an important 
biological achievement of living organisms 
and has been studied in relation to the under- 
lying mechanisms as well as to psychopathol- 
ogy. In particular, the possibility that size 
constancy differs among schizophrenics as 
compared with normals has been of interest 
to a number of investigators and theorists 
(Lovinger, 1956; Maes, 1957: Raush, 1952, 
1956; Reynolds, 1954; Sanders & Pacht, 
1952; Weckowicz, 1957). Bruner (1951) 
states the theoretical problem most succinctly 
when he suggests that a withdrawal from ob- 
ject relations and an increasing concern for 
the self might lead to a breakdown in per- 
ceptual constancy. However, as pointed out 
in a recent summary (Rabin & King, 1958), 
there is little agreement among the results of 
previous experiments. Sanders and Pacht 
(1952) found “overconstancy” among a schiz- 
ophrenic group as compared with normals, 
while Weckowicz (1957) reports the opposite 
relationship. Raush (1956) found no differ- 
ences between nonparanoid schizophrenics, al- 
though his paranoid group did differ from 
normals. 


The purpose of the present experiment is 
to re-examine this problem, utilizing a group 


of chronic, undifferentiated schizophrenics 
chosen from the patient population of a neu- 
ropsychiatric hospital to be most “withdrawn” 
with respect to their behavior. It is felt that 
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Administration Hospital, Tomah, Wise 


the selection of the ext 
a group of chronic patients whose diagnosis is 
more reliable provides an excellent test of the 
hypothesis under consideration. The method 
of testing has been modified after techniques 
which have been employed in experimental 
laboratories which a large body of 
normative data is already available (Holway 
& Boring, 1941; Leibowitz, in press; Leibo- 
witz, Chinetti, & Sidowski, 1956). Essentially, 
the technique requires the subject to signal 
which of two sticks larger, a simple task 
which has been used successfully with chil- 
dren and feebleminded groups. Data are ob- 
tained over a wide range of viewing distances, 
thus providing the for a more complete 
analysis of the size matching—distance rela- 
tionship. 


me cases from among 


and for 


Subjects 


The members ental group were 35 
male patients of the Administration Hos- 
pital in Tomah, Wisconsin, carrying a diagnosis of 
chronic, undifferentiated sct There was no 
history of brain damage, organic pathology, or visual 
defect. None of the p receiving EST, 
although many of them were on different types of 
drug therapy. They were selected by the psychiatric 
ward team to be mainly characterized by withdrawal 
symptoms. The patients was 
39.41 years with a rang 24 years to 56 years 
The average length of ps tric hospitalization was 
8.84 years with a range 2 years to 13 years. 

The members of the cont group were 20 psychi- 
atric aides and were sek random. Their mean 
age was 38.52 with a range of from 25 years 
to 54 years. 


izophrenia 


itients were 


mean age of the 35 


years 


Experimental Condition 


The experiment was 
ridor 9% feet 


jucted in a hospital cor- 


wide high, and 135 feet in 
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th. Large windows were located every 12 feet in 
th walls of the corridor and all tests were made in 
light. The test objects consisted of five wooden 
l H 2, 4, 8, 16, 

‘ 


! » lengths of 
24 inches. These were mounted in square bases 


ut from 1 inch stock t 
inches by 4 inches and were painted “flat” black. 
the test objects was propor- 
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yy each object, and the size of the corre- 
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all distances of observation 


schizophrenics 
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a condition which would obtain if judged size 
depended only on retinal image size. In the 
present study, retinal image size was constant 
so that matched size, according to this predic- 
tion, would be the same for all test objects 
and viewing distances. The other theoretical 
extreme, the “law of size constancy,” is 
sented by the diagonal line. According to this 
prediction, perceived size is independent of 
distance so that matched size would be equal 
to the actual size of the test objects. As is 
typical in such experiments conducted with 
binocular vision, adequate illuminance, and a 
well articulated visual field, the data for nor- 
mals lie very close to the prediction in terms 
of size constancy. The data for the schizo- 
phrenics also lie close to this theoretical line 

The nature of the data indicated the use of 
The Mann-Whitney U 
test was applied (Siegel, 1956). From this 
analysis it was concluded that size judgments 
of the schizophrenic group do not differ sig- 
nificantly from the normal sample on this 
task (U = 119, p < .424). In addition, re- 
spective variabilities of the two groups were 
compared. As expected, none of the differ- 
ences reached significance. 


repre- 


a nonparametric test. 


DISCUSSION 


The results of the present study indicate 
no difference between the ability of chronic 
schizophrenics and normals to judge correctly 
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the sizes of test objects from 10 to 120 feet. 
Both groups exhibit a high degree of size con- 
stancy. These data are in agreement with 
those of Raush (1952), who found essentially 
the same results with nonparanoid schizo- 
phrenics. The disagreement of the present 
data, as well as those of Raush, with other 
investigators could well be a result of differ- 
ences in the patient population (e.g., Sanders 
& Pacht, 1952, tested outpatients) or in the 
procedure employed. In any event, it is im- 
portant to recognize that even in those studies 
which do differentiate between normals and 
schizophrenics the magnitude of the differ- 
ences are small in relation to differences found 
as a result of variation of less subtle variables, 
such as the instructions given the subjects 
(Gilinsky, 1955; Holaday, 1933), or the age 
of the subjects (Beryl, 1926; Zeigler & Lei- 
bowitz, 1957). 

The present data have implications in rela- 
tion to psychopathology, as well as to a 
theoretical understanding of size constancy. 
With respect to the chronic schizophrenic 
group tested, it is clear that whatever dis- 
orders of perception, thinking, or behavior 
they suffer from, their size constancy is un- 
affected. Indeed, it would be difficult to imag- 
ine such patients, who move about the hos- 
pital grounds and engage in sports, judging 
incorrectly the sizes of environmental objects. 
The difficulty with schizophrenics, as seen 
clinically, may not be in relation to neutral 
objects, such as the sticks utilized in this 
study, but to other individuals and to their 
own patterns of thinking and emotions. Ap- 
parently, the withdrawal characteristic of 
schizophrenia, in general, and of the subjects 
utilized in this study, in particular, may be 
significant in interpersonal relationships only, 
and are not indiscriminately employed as 
defenses in purely nonaffective areas, such 
as size judgment. As suggested by Bruner 
(1951), one would be more likely to find 
differences between schizophrenics and nor- 
mals if the “test objects” were human beings. 

The present size matching data are also of 
importance in relation to the mechanisms 
which subserve the size constancy effect. It 
has been demonstrated that the size con- 
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stancy of children for near objects is good, 
but that the ability to judge correctly the 
sizes of distant objects develops slowly (Zeig- 
ler & Leibowitz, 1957). These data suggest 
that there are a number of separate mecha- 
nisms involved in size constancy, the nearer 
distances being mediated by the kinesthetic 
cues of accommodation and convergence, the 
farther by a more slowly developing percep- 
tual learning process (Leibowitz & Moore, 
1960). The available data indicate that this 
learning process is complete by the early 
teens or, in any event, since the patients in 
the present study were veterans, before the 
age at which the subjects tested were hos- 
pitalized. The lack of any difference between 
the groups in the present experiment suggests 
that the mechanisms underlying size con- 
stancy for neutral objects, such as used in 
this study, are independent of personality 
changes even as severe as those encountered 
in schizophrenia. This conclusion is in agree- 
ment with similar studies which demonstrate 
that the development of size constancy is in- 
dependent of intellectual processes as indi- 
cated by the lack of differences between 
feebleminded and normal subjects on size 
matching tasks (Jenkin & Morse, in press; 
Leibowitz, in press). Thus, it would appear 
that the development of size constancy, al- 
though closely related to the age of the sub- 
ject, is independent of mental and personality 
development. 


SUMMARY 


The ability to judge object size as a func- 
tion of distance was determined for a group 
composed of chronic, undifferentiated schizo- 
phrenics, as well as a control group of psychi- 
atric aides. 

There were no significant differences in the 
matches produced by the two groups. Both 
judged correctly the sizes of the test objects 
at all distances. 

It is suggested that the absence of any dif- 
ferences is due to the fact that size matching 
requires abilities which are fully developed 
prior to the onset of schizophrenia and which 
are unaffected by the characteristic with- 
drawal observed in this pathology. 
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THE RELATIONSHIPS BETWEEN 





INDIVIDUALLY DEFINED 


AND GROUP DEFINED SOCIAL DESIRABILITY 
AND PERFORMANCE ON THE EDWARDS 
PERSONAL PREFERENCE SCHEDULE 


ALFRED B. HEILBRUN, Jr. 


State Unit 


Since its publication in 1954, the Edwards 
Personal Preference Schedule (EPPS) has 
been the focus of considerable research bear- 
ing upon the problem of social desirability as 
1 source of response variance. The EPPS was 
constructed so that each item consists of a 
pair of statements matched for social desir- 
ability, thus presumably forcing the test taker 
to respond in accord with his actual behav- 
ioral characteristics rather than the social 
desirability of the statements. To the extent 
that Edwards (1959b) is correct in consider- 
ing social desirability as contributing prin- 
cipally to measurement error, studies of the 
relationship between social desirability and 
EPPS performance are important in an over- 
all assessment of the test’s validity. However, 
the many investigations have provided rather 
inconsistent results and conclusions. 

Several studies have concluded that social 
desirability has been eliminated as an im- 
portant determinant of performance on the 
EPPS. Navran and Stauffacher (1954) found 
a near-zero correlation between social desir- 
ability ratings of the traits being measured 
and scores on the EPPS scales. Borislow 
(1958) reported that subjects were able to 
change their test profiles under both social 
and personal desirability faking sets but con- 
cluded that desirability had been eliminated 
as a source of conscious response manipula- 
tion on the EPPS because no consistent scale 
patterns among the subjects were obtained 
under either set. Finding only a “slightly 
greater than chance” number of significant 
point biserial correlations between the sub- 
jects’ Social Desirability scale scores on the 
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ersity of lowa 


MMPI and frequency of choice of the first or 
second statement in each item pair of the 
EPPS led Kelleher (1958) to conclude that 
social desirability plays an insignificant role 
in EPPS performance. 

In contrast to these negative findings, sev- 
eral investigators have reported results which 
indicate that desirability remains an impor- 
tant source of response variance on the EPPS. 
Heilbrun (1958) found a .60 correlation be- 
tween EPPS scale scores and personal desir- 
ability of the test scales. 
Cohen, Gruen, 
obtained a 


Corah, Feldman, 
Meadow, and Ringwall (1958 
.88 correlation between the per- 
centage of college subjects judging the first 
statement in an item as more socially desir- 
able and the percentage of similar subjects in 


an independent group who endorsed that 
statement as_ self-characteristic. Edwards 
Vright, and Lunneborg (1959), pointing out 


that Corah et al. used only 30 EPPS items 
taken out of context, have repeated the pro- 
cedure using all items in the test. They report 
somewhat lower correlations (.69 and .61) for 
two samples, but the relationship between de- 
sirability and endorsement still remained. 
One common characteristic of all of these 
studies described above is that they have 
sought to evaluate the relationship between 
desirability and EPPS performance using 
some type of group desirability statistic (e.g., 
group average or group percentage) in the 
analysis. It is possible that if the relationship 
between each subject’s individual desirability 
ratings of the statement alternatives in the 
EPPS and his actual statement selections were 
determined, a more accurate inference could 
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be made regarding the influence of social de- 
sirability. The logical expectation would be 
that if social desirability is related to EPPS 
performance, this should be 


using an 


more evident 
individualized approach, since the 
individual desirability values which each sub- 


‘ 


ject assigns to test statements should be more 
relevant in predicting Ais statement endorse- 


ments than would averaged group values. A 
recent study by Taylor (1959), however, 
found the opposite to be the case. He deter- 
mined the correlations between both individ- 
ual and group social desirability values and 
endorsement of MMPI items for schizo- 
phrenic patients and found the group-aver- 
aged values correlated considerably higher (r 

.79) with performance than did the indi- 
vidual values (r = .36). 

The present study had three purposes: (a) 
to relate individual social desirability values 
and EPPS performance to clarify the extent 
to which individual social desirability set con- 
tributes to response selection; () to compare 
the prediction of EPPS responses from indi- 
vidual social desirability values with predic- 


tion of these responses from group social 
desirability values; and (c) if individual 
values are found to be related to overall 


EPPS performance, to evaluate the specific 
relationship between individual desirability 
and each EPPS scale. 


METHOD 
Subjects. The 58 subjects used in this study were 
obtained from a large undergraduate psychology 
course at the State University of Iowa. This sample 
included 29 males and 29 females 

Test. The EPPS (Edwards, 1959a) is an objective, 
rationally derived, multivariate, personality question- 
naire. It includes 135 different statements bearing 
upon personality characteristics, 9 for each of the 15 
traits measured by the test. These statements are 
arranged in 210 pairs with each pair including 2 
statements matched for social desirability but meas- 
uring different traits. The final score for each trait 
represents the number of occasions out of a possible 
28 on which the subject has selected that trait state- 
ment as more characteristic than the 
ment. 

Procedure. All subjects were initially group ad- 
ministered the EPPS under standard instructions. 
About 2 months later the same subjects were given 
the 135 different statements included in the test and 
asked to rate each as a trait in other people on a 
nine-point scale from highly socially undesirable to 
highly socially desirable. This rating procedure was 


paired state- 


identical to that described by Edwards (1957) in his 
initial derivation of statement desirability values. 
Having obtained the subject’s EPPS responses under 
standard conditions and individual desirability rat- 
ings from the nine-point scale for both statements in 
each item pair, it was possible to determine the per- 
centage of times the subject 
acteristic 


endorsed as self-char- 
the statement having the higher individual 
social desirability value for him. Items for which the 
subject had ascribed individual social desir- 
to both statements were omitted from 


equal 
ability values 
the analysis 
The second step was to determine the percentage 
of times each subject endorsed as self-characteristic 
the more highly socially desirable statement in each 
item pair where the desirability values were those 
assigned statements by 
the test 


to the Edwards in his initial 


This percentage reflects the 


construction of 


corre sponde nce between EPPS response and group 
ocial desirability values obtained from an inde- 
pendent group of college students. There are 204 of 


the 210 EPPS items for which the 
the paired statements differ 

Finally, the values assigned to the nine statements 
measuring each of the 15 EPPS variables were av- 
eraged for each subject, and these mean values de- 
fined the subject’s individual social desirability rat- 
ings for the 15 traits. High and low social desirability 


groups were 


group values of 


then constituted independently for each 

mean scale for these high and 
low groups were compared to assess whether the re- 
lationship between individual desirability and EPPS 
performance varied over scales. Since there are estab- 
lished sex differences on the EPPS scales (Edwards, 
1959a), 


trait and the scores 


taken to 
imately equal numbers of mal 
desirability 


plished by 


precautions were insure approx- 


and females in both 
groups for each scale. This was accom- 
distributing individual social desirability 
scores for each scale separately by sex and defining 
high and low groups by cutting at or near the 
median for both sex distributions. The two high and 
two low desirability groups for each scale were then 
recombined into one high end one low group with 


nearly equal numbers of males and females 


RESULTS 


Individual social desirability values and 
EPPS performance. There was a mean of 
150.97 items out of a possible 210 in which 
the paired statements had discrepant individ- 
ual social desirability values for the 58 sub- 
jects in this study. For those items within 
which the statement values varied, the sub- 
jects averaged endorsing the individually more 
desirable statement as more self-characteristic 
67.16% of the time. This percentage value 
differs significantly from a 50% chance ex- 
pectancy (¢ = 10.34 for 57 df, p < .001) and 


clearly indicates that performance on the 
EPPS is related to the individual ratings of 
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TABLE 1 
COMPARISON BETWEEN EPPS Scate Scores ror THE 15 SEPARATE HIGH AND 
Low Socrat DesrrRaABiLity Groups 





High Social Desirability Low Social Desirability 
EPPS Scale Mean SD N Mean SD N is 
Achievement 15.69 5.55 29 13.52 4.76 29 1.58 
Deference 11.62 4.00 29 10.90 3.04 29 a7 
Order 11.86 3.95 29 8.83 3.58 29 3.03** 
Exhibition 15.93 2.27 29 14.14 4.58 29 b 
Autonomy 15.33 4.04 30 11.89 3.84 28 1.46 
Affiliation 17.17 3.25 29 14.72 3.87 29 2.58** 
Intraception 20.47 4.68 30 15.71 5.32 28 3.61** 
Succorance 11.20 4.69 30 10.71 3.68 28 44 
Dominance 16.86 3.60 29 12.97 4.82 29 3.44** 
Abasement 12.63 4.72 30 12.46 5.36 28 13 
< Nurturance 16.97 4.75 31 12.93 4.61 27 3.26** 
Change 17.39 4.65 28 15.43 5.14 30 1.51 
Endurance 14.26 4.16 27 11.23 4.31 31 2.71% 
Heterosexuality 18.24 4.94 34 15.00 5.01 24 2.36* 
Aggression 14.36 4.62 28 9.50 4.47 30 4.05** 
* One-tailed ¢ tests were used 
» The Cochran-Cox approximation was used because of heterogeneity of variance 


* Significant at .05 level 
** Significant at .01 level 


statement social desirability. For the 58 sub- 
jects the range of endorsement percentage in 
the individually socially desirable direction 
was from 35.03 to 79.66, the subject provid- 
ing the lower end of the range being unique 
in that he was the only subject to fall below 
the 50% mark. 

Individual vs. group social desirability 
values. When the group mean social desir- 
ability values of the statement alternatives 
were considered, it was found that the sub- 
jects endorsed the group-defined higher so- 
cially desirable statement on 55.80% of the 
204 imperfectly matched items. This result is 
very similar to that reported by Goodstein 
and Heilbrun (1959) who found, for an inde- 
pendent sample of 248 college subjects, that 
the group-defined more socially desirable 
statement was endorsed 55.92% of the time 
on the average. The 55.80% value found 
in the present study differs significantly (¢ 

= 8.99 for 57 df, p< .001) from a chance 
expectancy of 50%, clearly indicating that 
performance on the EPPS is also related to 
group-defined desirability of the statements. 

A comparison between the predictableness 
of EPPS statement endorsement from group 
social desirability values and from individual 


social desirability values was made, and it 
was found that the difference between the 
mean percentage of 67.16 for items in which 
the endorsed statement was the more indi- 
vidually desirable was significantly higher 
than the 55.80% of items in which the en- 
dorsed statement was group-defined more de- 
sirable (¢ = 9.86 for 56 df, p < .0O1). 
Individual social desirability and separate 
EPPS scales. Although it had been shown in 
this study that the individual’s judged desir- 
ability of the statement alternatives does 
show a significant relationship to statement 
endorsement on the EPPS, the question still 
remained aj to the extent of this relationship 
for the separate scales of the test. Data perti- 
nent to this question are presented in Table 1 
which gives the means and standard devia- 
tions on each of the 15 EPPS scales for the 
15 separate high and low social desirability 
groups and the results of ¢ test comparisons. 
It can be seen that the group of subjects rat- 
ing the trait statements as more highly desir- 
able had the higher mean on each scale when 
compared to the group rating the trait state- 
ments as less desirable, the differences being 
significant for 9 of the 15 comparisons and 
approaching significance at less than the .10 
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level on 3 more. Thus the positive relation- 
ship between individual desirability and en- 
dorsement appears to hold for most but not 
for all of the EPPS scales. 


DISCUSSION 


One major finding in the present study is 
that college subjects endorsed the individ- 
ually defined, more socially desirable response 
as self-characteristic on at least two out of 
every three items in which the social desir- 
ability values of the paired statements dif- 
fered. This would seem to clearly indicate 
that social desirability has not been elim- 
inated as an important source of performance 
variance on the EPPS as has been suggested 
by some previous investigators (Borislow, 
1958; Kelleher, 1958; Navran & Stauffacher, 
1954). The inconsistencies in results among 
the many investigators bearing upon this 
issue most likely has stemmed from the use 
of group social desirability statistics which, 
based on the results of the present study, are 
less likely to be related to the individual’s 
response endorsement than the desirability 
values for that particular individual. 

A second major interest in this study was 
whether an individual’s responses to a struc- 
tured personality questionnaire are more 
highly related to his own judged social desir- 
ability of the responses or to averaged group 
desirability values. Taylor’s (1959) recent 
finding that group values were more highly 
related appeared to be inconsistent with the 
logical assumption that the closer individual 
parameters of behavior are approximated, the 
more effectively can behavior be predicted for 
the individual. The results of the present 
study were not in agreement with those of 
Taylor, since it was found that college sub- 
jects selected the statemént alternative on the 
EPPS which was individually more desirable 
more than 67% of the time while these same 
subjects endorsed the group-defined more de- 
sirable statement on less than 56% of the 
items. Taylor suggested that the lesser rela- 
tionship obtained between individual social 
desirability values and MMPI performance 
than between group values and performance 
could be attributed in large measure to the 
lower reliability of individual values which, 
in turn, would serve to more seriously atten- 
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uate this correlation. If this is true, it should 
be pointed out that the opposite results were 
found in the present study despite the lower 
reliabilities of the individual social desirabil- 
ity values. Since there were so many proce- 
dural differences between the Taylor study 
and the present one (e.g., nature of test item 
content, true-false vs. forced choice format of 
the tests, nature of the subject population, 
etc.,) further speculation regarding the differ- 
ential results appears unwarranted at the 
present time. 

The analysis of relationships between indi- 
vidual desirability and specific scores suggests 
a fairly general positive relationship over 
scales. An interesting finding was that the 
three scales which appear to be unrelated to 
social desirability (Deference, Succorance, 
and Abasement) have a common psycholog- 
ical element: their characteristic behaviors 
involve some type of subordination of the 
individual to another person. This suggests 
the hypothesis that the young adult college 
subjects used in this study, being in transition 
from a period of subordinate childhood rela- 
tionships into a period of more superordinate 
or coequal adult relationships, have not as yet 
stabilized their value systems regarding sub- 
ordinate, equal, and superordinate interper- 
sonal roles. A consequence of this would be 
that what the subject perceived as socially 
desirable and self-characteristic relative to 
role-related behaviors would be at least tem- 
porarily unrelated. 


SUMMARY 


This study investigated three issues relative 
to performance on the Edwards Personal 
Preference Schedule: (a) the relationship be- 
tween the social desirability of statement 
alternatives on the questionnaire and the 
endorsement of statements as self-character- 
istic when the individual’s own desirability 
values are used as predictors, (5) the predic- 
tion of statement endorsement using the in- 
dividual’s social desirability values for the 
statements as opposed to prediction using 
group-defined values for the statements, (c) 
the differential relationships between the in- 
dividual’s social desirability values and per- 
formances on the separate scales of the EPPS. 
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The results indicated that the individual’s 
social desirability set is an important source 
of variance in EPPS performance, since col- 
lege subjects endorsed as self-characteristic 
the more highly valued statement at least two 
out of every three occasions when the state- 
ment alternatives were assigned different in- 
dividual social desirability values. It was also 
found that individual social desirability values 
were more highly related to EPPS perform- 
ance than were group desirability values, a 
finding which differs from that reported by 
Taylor (1959) who used the MMPI as the 
performance variable. Finally, analysis by 
separate scales suggested that individual so- 
cial desirability set is related to most but not 
all of the EPPS variables. 
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BEHAVIOR PROBLEMS OF MIDDLE CHILDHOOD 
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Before the etiology and treatment of chil remaining variables was determined exclusively by 
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1 ; the 58 most common problems were selected for gen- 
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eral investigation 





fined. For the sake of general y and de scrip In use. the variables were ordered randomly, as 
tive efliclency, any concepts empio' ed in such embled in a format requiring ratings of 0 (no prob- 
cle fir it on should be non irbitrar' unitarv. lem), 1 mild problem), or 2 (severe pr y| lem), and 


: : —— 1 ibmitted for completion to 28 teachers of 83 n 
ind independent. Factor analytic methods ‘ _ nesnai : yee . kil 
‘ . lergarten and elementary school children in six dif 
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(Peterson, 1960). Out of personal curiosity, five- 
factor solutions were also tried for each data set; 
but these, as expected, were much less stable over 
age than the two-factor solutions, and only the latter 
will be reported. 

Factor scores were computed for all cases by un- 
weighted summation of pertinent problems checked 
by the teachers. Interjudge correlations were com- 
puted for the kindergarten sample, and further at- 
tention directed toward comparing the four age 
groups. Data for boys and girls were separated in 
all comparisons, because of well known sex differ- 
ences in problem expression, and mean factor scores 
were computed to show trends in the development 
of behavior problems over the years of middle child- 
hood 


RESULTS 
The Factors 


All four sets of rotated factor loadings are 
presented together in Table 1, an arrange- 
ment permitted only by the marked similarity 
between results at the four age levels.* Fac- 
tor 1 is obviously a conduct problem dimen- 
sion, closely resembling the like-named factor 
isolated by Himmelweit (1953) and “unso- 
cialized aggression” as defined by Hewitt and 
Jenkins (1946). Factor 2 has been labeled 
personality problem in accordance with Him- 
melweit’s designation and common usage. It 
is much like the “over-inhibited behavior” 
dimension which Hewitt and Jenkins found. 
Actually these terms, “personality problem” 
and “conduct problem,” are grossly inappro- 
priate. Both problems are personality expres- 
sions, and both affect conduct. But the cen- 
tral meanings seem clear enough. In one case, 
impulses are expressed and society suffers; in 
the other case impulses are evidently inhibited 
and the child suffers. 

The generality of these factors appears to 
be enormous. Not only do they emerge with 
striking uniformity over the limited age range 
and the particular variables and subjects ex- 
amined here; they have appeared in very 
much the same form with the recorded prob- 


2 The rating schedule, correlation matrices, and un- 
rotated centroid factor matrices have been deposited 
with the American Documentation Institute. Order 
Document No. 6632 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$2.00 for microfilm or $3.75 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 
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lems of treatment cases (Hewitt & Jenkins, 
1946; Himmelweit, 1953), and remarkably 
similar factors have appeared in the question- 
naire behavior of delinquent boys (Peterson, 
Quay, & Cameron, 1959). Considering all 
studies together, age has varied from early 
childhood to adolescence; problem status has 
varied from none, through clinic attendance, 
to incarceration for delinquency; data sources 
have varied from case history records, to 
standard ratings, to questionnaire responses; 
methods of factor extraction have varied from 
cluster inspection to centroid analysis; rota- 
tional methods have varied from none, through 
visual shifts to both orthogonal and oblique 
solutions, to analytic techniques. Through it 
all, the factors have stayed the same, and 
their definition at last seems adequate. The 
time is ripe for study, particularly experimen- 
tal study, of dynamics, etiology, and treat- 
ment. 


Factor Scores and Their Reliability 


Such investigations, however, cannot pro- 
ceed until various properties of the measuring 
devices have been examined. Factor scores 
were computed by unweighted summation 
over the first 15 variables for each factor as 
listed in Table 1. Below that point, many of 
the variables either have no appreciable load- 
ing on either dimension, or approximately 
equal loadings on both. The former condition 
holds especially for skin allergy, hay fever, 
nausea, and stomach-aches, which may often 
be purely somatic and qualitatively distinct 
from the other variables examined. The latter 
condition, roughly equal loadings on both fac- 
tors, holds for crying, nervousness, and cer- 
tain school attitudes, variables which are 
either very general in nature or exhibit some 
kind of developmental change. 

Reliability and interfactor correlation were 
examined for the kindergarten group only, 
since only for that group were dual ratings 
available. Interjudge r’s of .77 and .75 were 
found for Factors 1 and 2, respectively. These 
figures are exceptionally good for ratings, and 
are sufficiently high for most research pur- 
poses. The correlation between factors was 
.18, low enough to meet most requirements 
for independence. 
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TABLE 1 
Rotatep Factor LOApDINGsS 


Conduct Problem Personality Problem 


Factor K 1-2 3-4 5-6 Ke 1-2 3-4 5-6 
Conduct Problem 
Disobedience 74 77 69 86 03 04 07 11 
Disruptiveness 73 67 66 76 —0O4 19 —03 11 
Boisterousness 68 63 67 68 —16 07 -07 —09 
Fighting 54 73 61 77 —09 —(04 11 07 
Attention-seeking 54 67 63 76 12 10 07 02 
Restlessness 64 58 62 71 04 24 06 20 
Negativism 56 64 60 70 12 27 20 15 
Impertinence 57 57 53 76 02 08 00 08 
Destructiveness 59 65 51 65 —05 27 19 00 
Irritability 53 59 57 69 01 11 04 07 
Temper tantrums 54 37 49 OF O08 11 22 16 
Hyperactivity 51 49 54 49 06 12 00 03 
Profanity 30 42 64 60 —07 11 02 00 
Jealousy 23 50 41 56 06 10 12 11 
Uncooperativeness 67 67 53 71 09 31 38 21 
Distractibility 56 57 61 72 29 42 32 26 
Irresponsibility 60 65 49 65 22 18 47 20 
Inattentiveness 54 61 36 69 39 30 57 28 
Laziness in school 14 590 36 37 29 36 cc 31 
Shortness of attention span 48 54 31 oO 37 34 55 29 
Dislike for school 38 40 32 41 06 26 54 13 
Nervousness 22 25 46 50 40 44 22 26 
Thumb-sucking 29 17 36 05 09 28 03 15 
Skin allergy 16 02 —20 —05 01 21 05 20 
Personality Problem 

Feelings of inferiority 12 25 13 17 59 56 66 62 
Lack of self-confidence 12 26 13 16 0 61 00 58 
Social withdrawal 03 08 04 05 50 64 61 60 
Proneness to become flustere: 07 28 15 24 54 59 00 58 
Self-consciousness 13 -(03 —15 16 55 oO 47 63 
Shyness 16 -18 —23 —13 62 57 50 51 
Anxiety 01 19 24 10 50 57 55 47 
Lethargy 06 22 01 31 52 47 61 43 
Inability to have fun 15 06 —14 —09 49 48 53 48 
Depression 00 20 04 29 47 43 64 42 
Reticence 06 20 08 14 45 43 OF 41 
Hypersensitivity 06 26 18 30 40) 53 54 46 
Drowsiness (02 09 09 29 39 418 45 41 
\loofness 16 (3 —(4 05 51 3? 50 31 
Preoccupation 09 12 23 7 47 57 64 41 
Lack of interest in environment 24 30 21 51 40 44 67 28 
Clumsiness 16 21 36 17 43 54 34 36 
Daydreaming 14 26 21 49 53 46 69 47 
Tension 21 31 39 39 41 62 27 41 
Suggestibility 04 29 31 52 41 42 48 30 
Crying 15 14 06 59 27 48 32 19 
Preference for younger playmates 28 21 23 14 08 45 37 32 
Specific fears 09 24 —02 —(4 24 47 20 20 
Stuttering 11 17 Os 02 27 35 29 16 
Headaches 19 21 00 00 07 46 22 27 
Nausea —10 23 07 —(2 01 14 38 37 
Truancy from school 27 07 04 22 00 20 39 35 
Stomach-aches 10 18 05 —06 —(1 30 38 29 
Preference for older playmates —14 05 26 16 01 38 16 01 
Masturbation O08 14 26 04 —18 40 04 17 
Hay fever or asthma 15 01 17 11 05 21 09 03 


Kindergarten. 
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Mean conduct problem scores. 


Developmental Changes 


Mean factor scores were computed for boys 
and girls in all age groups, and the results are 
shown in Figures 1 and 2. Throughout mid- 
dle childhood, boys consistently display more 
severe conduct disturbances than girls, pos- 
sibly as a function of constitutional differ- 
ences, but more likely in response to different 
levels of social expectancy and tolerance for 
misbehavior. An interesting reversal, however, 
occurs in the expression of personality prob- 
lems. Boys evidently start school with more 
personality problems than girls, but around 
the seventh or eighth year such problems be- 
come more plentiful among girls. Again, so- 
cial pressures for sex-type conformity seem 
the likeliest causal agents. Reasons for the ap- 
parent upswing in problems at the fifth and 
sixth grade level are obscure. The increase 
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Fic. 2. Mean personality problem scores. 


may arise from the early agitation of adoles- 
cence, and the difficulty this can bring about 
in our society. 


SUMMARY 


This study was designed to improve struc- 
tural definition of children’s behavior prob- 
lems and to examine 
lems 


changes in those prob- 
middle childhood. 
Teacher ratings of 58 clinically frequent prob- 
lems were obtained for 831 kindergarten and 
elementary school children, and four separate 


over the years of 


factor analyses were conducted, one for the 
kindergarten subjects and one each for chil- 
dren in grades 1-2, 3-4, 


tors emerged with 


and 5-6. Two fac- 
remarkable invariance in 
all four analyses. The first implied a tendency 
to express impulses against society, and was 
labelled “conduct problem.” The second con- 
tained a variety of elements suggesting low 
self-esteem, social withdrawal, and dysphoric 
mood. 


It was called “personality problem.” 


Both factors have now appeared in a number 


of studies despite wide differences in subjects, 


variables, and anlaytic procedures. 
Comparisons over age showed that boys 
displayed more severe conduct problems than 
girls at all age levels examined. Kindergarten 
and primary school boys also showed more 
severe personality problems than girls, but 
at the two highest age levels this trend was 
reversed, and girls displayed more personality 
problems than boys. 
The definition of both 
adequate. Reliable, independent measures of 


dimensions seems 


the factors can be obtained, and the way to- 
ward investigation of dynamics, etiology, and 
treatment now seems clear. 
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ATTRIBUTION OF TRAITS AND EMOTIONAL HEALTH 
AS FACTORS ASSOCIATED WITH THE PREDICTION 
OF PERSONALITY CHARACTERISTICS OF OTHERS’ 


MARVIN SPANNER 


University of California, Berkeley 


Many unsystematized explanations have 
been offered concerning the manner in which 
judges predict the personality characteristics 
of others. The present paper will focus on 
two theories currently popular. The first, 
based on an interpersonal theory of person- 
ality, has been most extensively represented 
by Harry Stack Sullivan. Sullivan’s (1947) 
ideas on the subject are reflected in his 
famous dictum: 


“It is not that as ye shall judge so shall ye be 
judged, but as you judge yourself, so shall you judge 
others; strange but true so far as I know, and with 
no exception.” 


The above quotation suggests that inter- 
personal prediction is based primarily on an 
attributive mechanism in which the state of 
the individual’s self-concept determines the 
quality of the interpersonal appraisal. One 


1 This paper is based in part on a thesis submitted 
in partial fulfillment of the requirements for the PhD 
in the Department of Psychology of the University 
of California, Berkeley. The writer wishes to express 
his appreciation and indebtedness to Harrison G. 
Gough, thesis advisor; Donald W. MacKinnon, di- 
rector of the Institute of Personality Assessment and 
Research; and Victor B. Cline. 

This research was supported in part by the United 
States Air Force under Contract No. AF 18 (600)-8, 
monitored by Technical Director, Detachment No. 7 
(Officer Education Research Laboratory), Air Force 
Personnel and Training Research Center, Maxwell 
Air Force Base, Alabama. Permission is granted for 
reproduction, translation, publication, use, and dis- 
posal, in whole and in part, by or for the United 
States Government. Personal views or opinions ex- 
pressed or implied in this publication are not to be 
construed as necessarily carrying the official sanction 
of the Department of the Air Force or of the Air Re- 
search and Development Command. 

2 Now at Neuropsychiatric Institute, University of 
California, Los Angeles. 


can deduce from his statement the following 
hypothesis : 


I. There is a positive relationship between 
accuracy in predicting the personality charac- 
teristics of others and similarity of person- 
ality characteristics of the judge and the 
individual being judged. This would logically 
appear to follow because an understanding 
of others is based, according to Sullivan, on 
one’s self. Thus, if one tends to perceive 
others as similar to one’s description of one’s 
self, then those actually similar will be per- 
ceived accurately. Lingren and Robinson 
(1953) make a similar point when they state, 
in criticizing Dymond’s study (1950), that 
“Conventional people get good scores on 
empathy tests because most of their partners 
(or referrants) in the test are also conven- 
tional.” 

The second group of explanations of pre- 
dictive ability involves the mental health of 
the individual. Cline (1953) in an exhaustive 
review of the literature suggests that, on the 
whole, a good judge of others is emotionally 
sound, has good interpersonal relations, is 
happier, more popular and flexible. In addi- 
tion, most clinicians assume that an emotion- 
ally healthy individual is a better judge of 
personality because he has fewer problems 
which interfere with his understanding of 
others. In the present instance, two measures 
have been utilized to evaluate the emotional 
health of the subjects. The first is the cor- 
relation between the judge’s evaluation of 
himself and his evaluation of the sort of 
individual he would ideally like to be. The 
second is a composite rating by experienced 


observers of the “soundness” of the judge. 
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We have, therefore, the following two hy- 
potheses relating the “mental health” of the 
judge and his ability to predict personality 
characteristics of others: 

II. There is a positive relationship be- 
tween the self—ideal-self correlation of the 
judge (called self-acceptance and used as a 
measure of mental health) and his ability to 
predict personality characteristics of others. 

III. There is a positive relationship be- 
tween the rated “soundness” of a judge (used 
as a measure of mental health) and his ability 
to predict personality characteristics of 
others. 


METHOD 


Since both the tests of judging ability and the 
prediction instruments were devised and fully re- 
ported by Cline (1955), the following is an abbrevi- 
ated description of these procedures 


Tests of Judging Ability 

The basic procedure used was a sound movie 
of four fictitious employment interviews. The inter- 
viewees were male undergraduates. Three of them 
were 19 years old and single. The fourth was 33 
years old, married, childless, and a veteran. Each 
observer or judge was asked, after viewing each one 
of the four interviews, to make a series of predic- 
tions about the interviewee (hereafter called social 
object) on three specially devised instruments, only 
one of which was used in the present analysis. 

The four filmed interviews, conducted by a trained 
actor, were highly structured and fairly constant 
These were chosen from a group of nine films in 
order to obtain the most diverse personalities among 
the social objects. Cline has stated that the inter- 
views are divided into three phases: a standard job 
interview situation, a stress situation in which the 
interviewer is highly critical of the interviewee, and 
a relaxed after-interview abreaction session in which 
the interviewee’s reactions to the interview are dis- 
cussed. The intent was to get a dynamic record of a 
series of subjects on a sound film, rich and varied 
enough in the use of verbal and visual cues so that 
a judge would have an adequate sample of each 
social object’s social technique. 


Prediction Task 


The sole prediction task used in the present study 
to assess the judging ability of persons who observed 
the films was the Personality Word Card (PWC). 
Here the judge was asked to predict the responses 
of each of the four social objects on a 100-word 
adjective check-list. This list was derived from a 
300-word Adjective Check-List compiled by Gough.* 


8 Gough, H. G. Predicting success in graduate 
training. (Progress report) Unpublished manuscript, 
University of California Institute of Personality As- 
sessment and Research, 1952. 





The corrected split-half reliability for the PWC as 
a test of judging ability for a group of 100 college 
undergraduates was 83 + .06. 

The PWC was also used by both the judges and 
social objects to describe themselves. Comparisons 
were, therefore, readily made among the judges’ 
prediction of the social object, the social objects’ 
self-reports, and the judges’ self-reports 

Two other prediction tasks were given to the 
judges, although they were not used in the present 
analysis. One of them, the Behavioral Postdiction 
Test, in which the judge was required to predict 
the social object’s behavior in real life, was discarded 
because it was discovered, when a simple analysis 
of variance was carried out, that the source of 
variation attributable to the judges was not signifi- 
cant. A third instrument, a multiple-choice sentence 
completion test, was also discarded as a test of 
judging ability, because Cline had found that the 
corrected split-half reliability for a group of 10% 
college undergraduates was only 36 + .06 


Judges 


One hundred Air Force Captains were tested at 
the Institute of Personality Assessment and Re- 
search, as part of a large scale study of officer 
effectiveness. They were tested in groups of 10 in 
a living-in situation similar to that employed by the 
OSS during World War II (Office of Strategic Serv- 
ices, 1948). Their average age was 33.6 with a range 
of 27 to 49. Ninety-five percent of the officers were 
married, with an average of two children. Fifty-six 
had attended college, seven had done some graduate 
work, and only three had not graduated from high 
school 


Measures 


Accuracy. This variable is defined as the tetra 
choric correlation between the social objects’ self- 
reparts and the judges’ prediction of them on the 
PWC. Since there are four social objects, each one 
of the 100 judges has four accuracy scores 

Similarity. This measure is defined as the tetra- 
choric correlation between the social objects’ self- 
reports and the judges’ self-reports on the PWC 
Here too, each one of the 100 judges has a similarity 
score for each of the four social objects 

Self-Acceptance (self-ideal-self correlation). Each 
judge completed the PWC twice, once describing 
himself as he was and then a second time describing 
himself as he would ideally like to be. The measure 
of association used was the phi coefficient. It is 
assumed that the greater the degree of association 
the more self-accepting and therefore the “healthier” 
the individual 

Soundness. This variable was drawn from a pool 
of 30 traits which were used by 10 staff members 
in rating the judges (officers). The following pro- 
cedure was used. After the first four groups of 1( 
officers had been run, 10 staff members of the 
Institute of Personality Assessment and Research 
rated each of the 40 officers in the 30 traits. A 
normal distribution was required of the raters, using 
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Similarity 
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soundness 
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a five-point scale. The same procedure was followed 
in rating the remaining 60 judges, after they had 
all been assessed. A composite trait rating 
derived from each assessee by combining the 
vidual ratings of the 10 staff members. Each 
of the trait pool was then assigned a scale 
for each assessee. 
Soundness was defined as 


was 
indi- 

item 
value 


self-insight and 
and under- 
Absence of serious emotional 
and manner. Good 
balance of social conformity and spontaneity. 


Maturity in personal relations; 
self-acceptance, as well 


as acceptance 
standing of others f 


problems. Stability of 


mood 


All of the 
simply 


analysi 


above m«¢ 


obtained, are 
statistical 


asures, 


treated as scores, for purposes 


once 


RESULTS 

The first hypothesis—that accuracy of 
prediction of personality traits of others is 
dependent on the degree of similarity of judge 
ind social object—was tested by correlating 
imilarity and accuracy for each of the social 
Table 1 
relation between similarity and accuracy for 
Ol Je ts 


objects. As can be seen ir the cor- 
Social Object I is .01 and for So 
III and IV is .00. The cor: 
Object II, however, is .28, 
O1 level of confidence. Since this is the only 
one of the four social objects where signifi- 
cance was achieved, the first hypothesis, that 
there is a relationship between accuracy of 
prediction and similarity of judge and social 
object, was not substantiated. 

The following results are concerned with 
the relationship of mental health and the 
ability to predict personality characteristics 
of others. As can be seen in Table 1 there is 
no significant relationship between the ability 


elation for Social 


significant at the 


to predict personality characteristics of others 
and self-acceptance (self 
tion) or soundness of the judge, and, there- 
fore, both Hypotheses II and III have not 
been substantiated. 

Although it has been shown that similarity 
of personality characteristics of judge and 
social object is not related to accuracy of 
prediction on the part of the judge, other 
factors might prevent a simple expression of 
this relationship. For example, a judge who 
is similar to a social object may predict ac- 
curately only if he self-accepting (i.e., his 
self—ideal-self correlation is high). With this 
in mind, a triple classification analysis of 
variance for judging ability was carried out. 
The judges were divided into a high and low 
similarity group as well as a high and low 
self-accept ince group (self—ide 
tion), for 
Judges above the 


and 


ideal-self correla 


il-self corre la- 
each of the four social objects. 
median were considered 
below the median 

low on both similarity and self- 


higk those were 
considered 
acceptance. 
rying number of observa- 


tions in the cells of the 


Because of 
triple classification, 
pool the residual 
s of squares. This is a 
estimate of the error variance; 


it was necessary to and 
triple interaction sur 
conservative 
if the triple interaction is 


it with the error \ 


significant, pooling 
nce would only produce 


term and tend to 


verestimate 
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: rABI 3 
\CCURACY AND “FAVORABLI ADJEC 
FOR Each oF THE SocraAr O1 I 
Pe , 
Loug 
Fa ’ 
soca 
Obj 
Socia Mear I} 
Objet Accuracy Self 
I 46.24 $2 
II 23.96 4) 
Il] 33.51 31 
IV 64.51 48 


The analysis of variance shown in Table 2 
reveals two significant mean squares: social 
objects, and the interaction between simi- 
larity and self-acceptance. In attempting to 
understand the basis for the large mean 
square for social objects, it was noted that 
the self-descriptions of the social objects 
tended to be quite favorably toned. A measure 
of this favorable tone derived from 
Gough’s list of “favorable” adjectives. Gough 
(see Footnote 3) was able to derive the list 
by asking a group of 30 college students to 
rate the entire list of adjectives for its favor- 
ability. The highest rated 25° were selected 
for the favorability key. One question which 
immediately comes to mind is whether there 
is a relationship between accuracy of predic- 
tion, on the part of the judge, and favorable- 
ness of self-perception, on the part of the 
social object. If one compares the list of 
favorable adjectives with the mean accuracy 
score for each of the social objects, in Table 3, 
one notes a perfect correlation (rho) between 
the two variables. Thus, the accuracy of 
prediction appears to depend, in part, on 
favorableness of self-perception of the social 
object. 


was 


The second significant mean square noted 
in Table 2 is the interaction between simi- 
larity and self-acceptance. To further clarify 
the relationship, the accuracy scores of judges 
conforming to the four possible combinations 
of similarity and self-acceptance, under the 


present conditions, are shown in Table 4. 


Two groups can 


be considered relatively 


accurate: judges similar to a social object and 


self-accepting, and 


social object and 


dissimilar to a 


The 


judges 


not self-accepting. 


other two groups can be considered relatively 
inaccurate: judges similar to a social object 


and not self-accepting, and judges dissimilar 


‘ 


DISCUSSION 


What are 
statement 


the 
that 


implications 
one judges others only in 


to a social object and self-accepting. 


AND CONCLUSIONS 


of Sullivan’s 


terms of one’s self? It obviously creates a 


huge solipsism in 
drawn 
judgment process. 
exhibited between 


social object and 


between one’ 


which no distinction is 
and others in the 
The lack of relationship 
similarity of judge and 


accuracy of prediction 


ity 
S SP il 


(Hypothesis I), in the present experiment, 
suggests that the prediction of personality 
characteristics of others is not a reflection of 
the judge, a simple attribution to another of 
the traits of the judge. 


In addition, no relationship was found, in 
the present study, between the mental health 


of an 
criterion, 


relation): and 


individual—using 
self-acceptance 


(b) 


self-rating 
ideal-self cor- 
external criterion, 


(aq) a 


(self 
an 


judged soundness of an individual by expert 
raters—and his ability to predict personality 
characteristics of others (Hypotheses II and 


IIT). 
When, 


acceptance { self 


however, 


ideal-self correlation) 


both similarity and self- 
inter- 


act, there is a significant effect on accuracy 


This 


some 


result can 
assumptions 


be 


we take 
psycho- 


understood if 
concerning the 


logical meaning of each of these variables. Let 


TABLE 4 

Mz \ RAC F THE INTERACTION ON Hic! D 

| Scores ON Botn SIMILarity 

» Se_tr—-ACCEPTANCI 

Grou N Mear SD 
High Sim and High SA 120 46.18 20.07 
High Sim and Low SA 80 39.08 22.40 
Low Sim and High SA 80 37.49 21.23 
Low S ind Low SA 120 $2.97 21.93 

Note High =Judges ab is SA 


judges below mediar 
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us assume that a judge who is relatively self- 
accepting will tend to judge others as similar 
to himself. The unconscious basis for his 
predictions might be as follows: “I am a nice 
person and so are other people.” A judge who 
is relatively low on self-acceptance, however, 
will tend to judge others as dissimilar to 
himself. Here, the unconscious basis for his 
predictions might be: “I am not a nice 
person, but others are.” The mechanisms 
described here are similar to those character- 
ized by Cameron and Magaret (1951) as, 
respectively, assimilative and disowning pro- 
jection. 

As was described above, there are objective 
measures of the similarities of judge and 
social object. If we focus our attention on 
thuse judges who are high on self-acceptance, 
we can hypothesize that they would be ac- 
curate in the prediction of personality char- 
acteristics of those social objects similar to 
themselves, and inaccurate in the prediction 
of personality characteristics of those social 
objects dissimilar to themselves. This would 
logically follow since, as postulated above, a 
self-accepting judge would tend to perceive 
others as similar to himself. Those actually 
similar to him would, therefore, be judged 
accurately, while those dissimilar to himself 
would be judged inaccurately. 

In a similar fashion, we would predict that 
judges with low self-acceptance scores would 
accurately perceive those social objects who 
are dissimilar to themselves, while inaccu- 
rately perceiving those social objects similar 
to themselves. This, too, would logically 
follow since, as postulated above, judges with 
low self-acceptance scores would tend to per- 
ceive others as dissimilar to themselves. 
Those social objects actually similar to these 
judges would, therefore, be perceived inaccu- 
rately, while those dissimilar to them would 
be perceived accurately. 

The post hoc explanation invoked here 
conforms to the results in Table 4. While the 
results presented suggest that judging the 
personalities of others involves more than an 
attribution of one’s traits to another, it also 
suggests that judges engage in a mechanical 
type of prediction based on their self-concept. 
If the judges are self-accepting, they judge 
others in a similar manner. If the judges are 


Spanner 


not self-accepting, they judge others in a 
contrary manner. Accuracy, therefore, de- 
pends not only on whether an individual is 
self-accepting or not, but also on whether 
the judges are similar or dissimilar to the 
social object. Bronfenbrenner (1958) has 
made a similar point. He utilized the variable 
“Favorability toward others” in his analysis, 
however, rather than self-acceptance used in 
the present study. 

Looked at from this point of view, the 
ability to predict personality characteristics 
of others should not be perceived as a trait 
residing within the judge, but, rather, as 
based on the personality patterns of both 
judge and social object. This point of view 
is further substantiated by an analysis of 
variance of judging ability carried out by 
the author (Spanner, 1955), in which the 
interaction between judge and social object 
approached significance at the .05 level. 

This approach suggests, therefore, that the 
question to be asked will no longer exclusively 
be “Is he a good judge of personality?” but 
also “Which social objects can he judge 
well?” Another facet of the same problem 
would be an attempt to break down the 
evaluation of personality into various areas 
and to assess the ability of judges in each 
area. This presupposes a representative de- 
sign of the sort suggested by Brunswik 
(1947), in which there would be not only 
representative sampling of subjects (judges), 
but also of objects (social objects). In the 
present instance a preliminary attempt was 
made to analyze the different skills involved 
in judging social objects with favorable as 
compared to unfavorable self-perceptions. 
The attempt was abandoned, however, be- 
cause of the small number of social objects 
(four). 

Possibly if some of the aforementioned 
approaches are handled successfully, eventu- 
ally, we will then be in a position to answer 
the following question: “How does the ability 
of a judge to predict the personality charac- 
teristics of others relate to his own person- 
ality dynamics?” This, of course, is related 
to the entire area of ego psychology and the 
differential utilization of ego defenses by an 
individual in judging the personality charac- 
teristics of others. 





Prediction of Personality Characteristics 


SUMMARY 


One hundred military officers were pre- 
sented with sound movies of stress interviews 
with four interviewees (social objects) and 
asked to predict the responses of the social 
objects on an adjective check-list, as well as 
to describe themselves on the same instru- 
ment. From these basic data measures of 
accuracy of prediction and similarity of judge 
and social object were obtained. Measures 
of the self-acceptance (self—ideal-self correla- 
tion) and “soundness” of the judges were 
also obtained. 

Contrary was 


to expectations, accuracy 


found to be unrelated to similarity of judge 
and social object, to self-acceptance, and to 
the rated soundness of the judge. Accuracy 


scores were related, however, to a combina- 
tion of similarity and self-acceptance vari- 
ables. The concepts of disowning and assimi- 
lative projection were utilized as a post hoc 
explanation of accuracy. 
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HOSTILITY EXPRESSION AMONG DELINQUENTS 
OF MINORITY AND MAJORITY GROUPS 


DON L. SWICKARD anp BERNARD SPILKA 


University of Denver 


The frustration-aggression hypothesis has 
frequently provided a theoretical framework 
for explaining the phenomena of crime, delin- 
quency (Dollard, Doob, Miller, Mowrer, 
Sears, Ford, Hovland, & Sollenberger, 1939), 
and prejudice (Allport, 1954; Dollard et al., 
1939). Two prevalent and pervasive sources 
of frustration which are seen as motivating 
anti-social behavior are low socioeconomic 
status and minority group membership 
(Dollard et al., 1939). Evidence to support 
this position may be adduced from the work 
of Glueck and Glueck (1950) and Suther- 
land and Cressy (1955), who show clearly 
the relationship between poverty and delin- 
quency. On the other hand, Hammer (1953), 
McCary (1950), and Mussen (1953) have 
demonstrated that minority group members 
reveal more manifest aggression than do 
members of majority groups. The factors of 
poverty and discrimination have been con- 
founded in these studies, thus failing to 
clarify the role of each. It may be hypothe- 
sized, however, that minority group member- 
ship and low socioeconomic status combine 
to effect a greater manifestation of hostility 
than would simply result from the frustra- 
tions of poverty alone. Delinquents who are 
members of a minority might, therefore, be 
expected to reveal more evidences of hostility 
than delinquents who are majority group 
members. The present investigation was 
designed to assess this position, utilizing 
Spanish-American and non-Spanish, white 
delinquents. That the Spanish-American 
group is a negatively valued minority has 
been pointed out by Jones (1948) and 
Saunders (1954) in their writings concerning 
the Spanish- or Mexican-American, and his 


status among Anglo-Americans. Specifically, 
the hypotheses tested were: 


1. A group of Spanish-American delin- 
quents of low socioeconomic status will show 
significantly greater evidences of aggression, 
in both amount and kind, than will be demon- 
strated by a similar group of non-Spanish, 
white delinquents. 

2. Because of tendencies on the part of 
delinquents to give a “good impression” on 
personality measures (Lindzey & Goldwyn, 
1954; Vane, 1954), it will be necessary to 
correct hostility measures for social desir- 
ability. 

3. Social desirability will relate negatively 
to manifest and extrapunitive measures of 
hostility and positively to intropunitive and 
impunitive measures of hostility. 


METHOD 


Subjects. The original group studied consisted of 
81 male and female delinquents on probation at 
the Denver Juvenile Court. The subjects were 
selected in such a way as to eliminate all who were 
the result of mixed group backgrounds, who had 
been diagnosed as neurotic or psychotic, were other 
than lower class, or who had been institutionalized 
in the last 2 years. On the basis of obtaining scores 
of 10 or more on the MMPI Lie scale, 7 subjects 
were eliminated. The final group thus consisted of 
25 Spanish-American males, 12 Spanish-American 
females, 25 non-Spanish white males, and 12 non- 
Spanish white females. All subjects 
14 and 17 years of age. 

Tests. The tests administered were the Rosenzweig 
Picture Frustration Study (Rosenzweig, Fleming, & 
Clarke, 1947), the Siegel Manifest Hostility scale 
(Siegel, 1956), the 39-item Social Desirability scale 
extracted from the MMPI by Edwards (1957), and 
the MMPI Lie scale (Hathaway & McKinley, 1951) 

Procedure. All subjects were tested in groups of 
from 5 to 15, in the courtroom of the Denver 
Juvenile Court. All subjects were assured that the 
testing had nothing to do with their probation and 


were between 
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that the results would not be made available to their 
probation counselors. All subjects were cooperative 
and answered all questions without difficulty. 


RESULTS AND DISCUSSION 


Table 1 contains the means and standard 
deviations for the four groups on the various 
tests administered. Pearson product-moment 
correlations were computed between the 
Social Desirability scale and the other meas- 
ures, as seen in Table 2. Hypotheses 2 and 3 
obtained support as the Social Desirability 
scale correlated —.655 with the Siegel Mani- 
fest Hostility scale, —.372 with the extra- 
punitive scores, .222 with the intrapunitive 


and .262 with the impunitive scores. 
The first two coefficients are significant at 
the .01 level of confidence, and the last two 
at the .05 level, with 72 degrees of freedom. 

One-way simple classification analyses of 


scores, 


among Delinquents 


variance and covariance were computed be- 
tween the four groups: Spanish-American 
males and females, and non-Spanish white 
males and females. This ‘analysis was more 
conservative than a group X sexes design 
would have yielded. The latter was computed 
in the analysis of variance for a 2 X 2 dis- 
proportionate subclass numbers design as 
recommended by Snedecor (1956), in order 
to check for further significance than might 
have been obtained via the single variable 
design. No additional significance was ob- 
tained for the group, sex, or interaction terms. 
Because of the problem of interpreting an 
analysis of covariance for the 2 x 2 design 
with unequal and disproportionate subclass 
numbers, it was decided to employ the more 
conservative analysis reported here. 

The analyses of variance across groups 
were significant at the .05 level for social 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF AGE, SOCIAL DeEsIRABILITY, AND MANtiFEST HOsTILITY 


AND OF EXTRAPUNITIVE, INTROPUNITIVE, 


AND IMPUNITIVE SCORES FOR ALL GROUPS 


Group” 


Variable* 


Age 
Mean 
SD 


SD 


M 


Mean 
SD 


* SD refers to So 
Impunitive Scores. 
SA-M refers to Spanish- Americ 
non-Spanish, } 


ial Desir 


ability scaie 


SA-F, S; 


white females 


MH, Manifest Hostility scale: E, 


vanish-Amer 


NSW-M 
(N = 25 


Extrapunitive Scores; I, Intropunitiv 


ican females; NSW-M 


non-Opanish, white t 
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TABLE 2 


CORRELATIONS BETWEEN 
THE EXTRAPUNITIVE, INTRAPUNITIVE, 


THE SociAL DesrRABILITY SCALE 
AND IMPUNITIVE 


AND THE MANIFEST Ho 


SCORES ON THE | 


TILITY SCALE, AND 


.OSENZWEIG 


PICTURE-FRUSTRATION STUDY 


SA-M 
Variable (N = 25 
SD and MH 
SD and E 
SD and I 
SD and M 


— .648** 

—.110 

— .136 
.104 


.627* 
581 
421 
588* 


data. All subgroup coefficients in the first 
is at best a conservative estin 
* Significant at .05 level 
** Significant at .01 level. 


three rows are 


ate of this relationsh 


desirability (see Table 3), and not significant 
for the Manifest Hostility scale (see Table 
3). No significance was obtained for the 
analyses of extrapunitiveness, intropunitive- 
ness, and impunitiveness (see Table 3). Be- 
cause of the significant differences among the 
groups in social desirability and the correla- 
tions noted above, analyses of covariance 
were computed among the groups on the 
hostility scores, while adjusting the means for 
social desirability. Significance was attained 
for the Manifest Hostility scores, but not for 
any of the other measures (see Table 3). 
The adjusted means computed for the Mani- 
fest Hostility scale are: Spanish-American 
males, 24.3; Spanish-American females, 19.8; 
non-Spanish white males, 19.3; 
Spanish white females, 19.7. 

It is apparent that Hypothesis I is partially 
supported since the Spanish-American male 


and non- 


TABLE 3 

F RATIOS BETWEEN GROUPS FOR ORIGINAL MEANS 
AND MEANS ADJUSTED FOR SOCIAL DESIRABILITY 
Original 
Means 

(df = 3.70 


Adjusted 
Means 


Variable df = 3.69 


SD 

MH 
E <1 
I <1 


3.327* 
1.011 


M 


<1 


* Significant at .05 level 


® The r's for the total group are not based on an averaging of the s 


Group 


NSW-M 
N = 25 


706** 
466 
288 


77 )** 


ibzro 


ogenou 


group obviously manifested more hostility on 
the Manifest Hostility scale than did any of 
the other groups. 

It is not clear why the Spanish-American 
females did not reveal more evidences of hos- 
tility than did either of the non-Spanish white 
groups. It is noteworthy that the significance 
obtained among the groups on the Social 
Desirability scale is probably, in part, due 
to the fact that the Spanish-American females 
received the lowest mean this 
variable. 

Jones 
patterns 


score on 
(1948) in a study of the ethnic 
of the Mexican family in the United 
States indicates that the home training given 
girls in this group is radically different from 
that given boys. Among other things mani- 
festations of hostility are strongly disap- 
proved. The boys of this group might thus 
be more similar to the non-Spanish white 
group in background, and it is possible that 
the Social Desirability scale thus less 
appropriate for the Spanish-American female 
than for the male. This somewhat different 
conception of social desirability plus pres- 
sures against evidencing overt aggression 
would account partially for the low Social 
Desirability scale obtained by the 
girls, and failure of the adjusted Manifest 
Hostility scores to increase along with those 
of the Spanish-American males. 


1S 


scores 


The failure of the Rosenzweig Picture- 
Frustration Study to yield significance has 
been noted before (Lindzey & Goldwyn, 1954; 
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Vane, 1954), and it may be that the P-F 
study is actually not measuring manifest hos- 
tility but rather aggression on a different level 
of the personality than that which might be 
significant for the present study. The P-F 
study does, however, seem sensitive to the 
variable of social desirability, as is the Siegel 
Manifest Hostility scale, and it is suggested 
that future research involving these two in- 
struments take this into consideration, as was 
done here. 

Another meaningful consideration may be 
based on the possibility that the lower-class 
culture simply supports the expression of hos- 
tility without it necessarily being related to 
frustration. The assumed frustration-aggres- 
sion relationship may therefore be inappro- 
priate. This would account for the differences 
observed on the Manifest Hostility scale and 
the failure of the P-F study to reveal the hy- 
pothesized tendencies since the latter measure 
assumes the significance of frustration 

In brief, the results of the present investi- 
gation indirectly lend support to the frustra- 
tion-aggression hypotheses by suggesting that 
the combination of frustration 
socioeconomic and 


low 
group 


from 


status minority 


membership may increase the expression of 


aggression that which is observed 
when only low socioeconomic conditions are 


present. 


over 


SUMMARY AND CONCLUSIONS 


The present study was designed to investi- 
gate the relation of hostility to a combination 
of low socioeconomic status and minority 
group membership. The frustration-aggression 
hypothesis was employed as a_ theoretical 
referent, and it was hypothesized that the 
increased frustration of minority group mem- 
bership in addition to low socioeconomic 
status, would produce more manifestations of 
hostility than would be observed in majority 
group members of similar class levels. It was 
further hypothesized that tendencies to give 
a “good impression” would relate negatively 
to manifest hostility and extrapunitive tend- 
encies while relating positively to  intro- 
punitiveness and impunitive expressions. On 
the basis of the latter considerations, it was 
felt that obtained hostility scores would have 
to be adjusted to minimize such biases. 
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Eighty-one Spanish-American and _ non- 
Spanish white delinquents on _ probation 
served as subjects. All subjects were adminis- 
tered the Siegel Manifest Hostility scale, the 
Social Desirability scale derived by Edwards, 
the Rosenzweig Picture-Frustration Study, 
and the Lie scale from the MMPI. Seven 
subjects who obtained scores of 10 or more 
on the Lie scale were eliminated from the 
study. Significant negative correlations were 
found between the Social Desirability scale 
and the Siegel Manifest Hostility scale, and 
the extrapunitive scores from the Rosenzweig 
Picture-Frustration Study. Significant 
tive correlations were obtained between the 
Social Desirability scale and the measures of 
intropunitiveness and impunitiveness. Once 
the hostility means were adjusted to remove 
the effects of social desirability, significance 
obtained between the groups on the 
Manifest Hostility scale. The Spanish-Amer- 
ican male group was shown to manifest sig- 
nificantly greater hostility on this measure 
than any other group, thus partially support- 
ing the main hypothesis. 


posi- 


was 
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THE USE OF OPPOSITE SEX SCALES AS A MEASURE 
OF PSYCHOSEXUAL DEVIANCY'’ 
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anD E. MORGAN 


VU 


Most self-report tests of personality obtain 
an estimate of male and female sex role 
identification through the use of like-sex 
scales. Sex role identification or the lack of it 
is typically evaluated by the subject’s re- 
sponses to a scale of items constructed and 
validated on his own sex (Hathaway & 
McKinley, 1943; Strong, 1943). Thus, a 


person obtaining an average or high score on 
his own sex scale is said to be identified with 
his own sex, and a person obtaining a low 
score on his own sex scale is said to lack 
identification with his own sex, or to be more 
like the opposite sex in his responses. In an 


earlier paper the authors described the deri- 
vation of a scale for boys and girls between 
the ages of 8 and 12 years in which each 
subject received a score on an opposite-sex 
scale as well as on a like-sex scale (Rosenberg 
& Sutton-Smith, 1959). The existence of 
these scales for measuring masculinity and 
femininity has made it possible to investigate 
the relative effectiveness of both the like-sex 
scale and the opposite-sex scale for discrimi- 
nating sex role identification. 

It is generally assumed that individuals 
who are faulty or aberrant in their sex role 
identification will be more emotionally dis- 
turbed than those who are not (Henry, 1948; 
Terman & Miles, 1936). Therefore, they 
should tend to have less satisfactory scores on 
measures of emotional stability. This paper 
examines the relationship between various 
types of scores on a masculinity and a femi- 
ninity scale and several independent measures 
of emotional stability. The independent 

1 This study 
Scholarly 
State 


was facilitated by a grant 
Advancement Committee, 
University. 


from the 
Bowling Green 


fonroe County School 


measures of emotional stability make it pos- 
sible to explore not only the existence of 
sex role confusion, but also certain of its 
qualitative aspects. 


METHOD 


A group of 337 children in the fourth, fifth, and 
sixth grades in two elementary schools in Northwest 
Ohio and Southeast Michigan comprised the sample.? 
Several instruments were administered to the entire 
group: (a) a check list of games and play activities 
which is described elsewhere (Rosenberg & Sutton- 
Smith, 1959) and which effectively yields measures 
of masculinity and femininity, (6) an empirically 
derived scale which has been found to measure ex- 
tremes of impulsive behavior (Sutton-Smith & Rosen- 
berg, 1959), (c) the Children’s Manifest Anxiety 
scale which affords a measure of anxiety (Castaneda, 
McCandless, & Palermo, 1956), and finally (d) the 
Brown Personality Inventory (Brown, 1934). The 
latter scale, derived in 1934, has been found to dis 
criminate neuroticism in children, and to be highly 
correlated with the CMA scale (Rosenberg, Sutton 
Smith, & Morgan, in press). Its addition provides 
subscores which reveal the source of major concern 
for neurotic children (home, school, physical, inse- 
curity, and irritability). The “home” subscale in- 
cludes feelings of parental rejection, parental severity, 
and sibling jealousy. The “school” subscale identifies 
feelings of inadequacy in the classroom. The “physi- 
cal” subscale deals with disturbances in physical well- 
being which appear to be psychogenic in origin. The 
“insecurity” subscale contains items reflecting anx- 
iety, interpersonal inadequacy, and generalized neu- 
rotic concerns. The “irritability” subscale examines 
emotional reactivity, low tension-binding qualities, 
and severe feelings of rejection (Brown, 1934) 

An analysis was conducted of the scores on im- 
pulsiveness, anxiety, and neuroticism of boys and 
girls in the upper and lower quartiles on the 
masculinity and femininity scales 


2 The authors wish to express their indebtedness 
to H. Lehtomaa, R. Knestrict, and the teachers of 
Dundee, Michigan, schools for their assistance in the 
collection of the data 
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TABLE 1 


Neurotic INDICES OF Boys 


Group Imp Anx BPI 
San 16.83 22.36 
3.61 7.98 13.63 


48 48 45 


High 
Boys 


8.26 17.48 24.81 
3.60 8.21 12.56 
44 46 43 


Low 
Be ys 


8.09*** 
4.08 


21.09* 
7.86 
45 


High 


Girls 


Low 
Girls 


* Significant at t 
** Significant at the 


ek Ci 


ignificant at the 


RESULTS 


The performances on the neurotic indices 
of boys and girls high and low on masculinity 
are presented in Table 1. The slight variation 
in the size of the Ns is due to incomplete 
protocol on various tests. The results of 
Table 1 suggest that scoring high or low on 


the masculinity scale is reflected very little 
in independent measures of neurotic behavior 
for boys. The general tendency is for boys 
low on masculinity to appear more anxious 


AND Grrits HiGH AND LOW ON THE MASCULINITY 


SCALE 


and neurotic, but the differences between 
them and the high scorers fail to achieve 
statistical significance. In the case of the 
girls, the results are more substantial. Girls 
high on masculinity tend to be more im- 
pulsive and anxious than girls low on mascu- 
linity, and show more neuroticism about their 
home life (parental rejection, etc.). 

The performances on the neurotic indices 
of boys and girls high and low on femininity 
are presented in Table 2. The results indicate 


TABLE 2 


Neurotic Inpices or Boys 


Group Imp Anx BPI 
10.33** 18.26** 26.00** 
3.57 7.86 13.04 

43 43 38 


High 
Boys 


8.52 14.31 20.73 
6.67 9.99 


42 41 


Low 
Boys 
] 19.86 25.2: 
SD ; 7.13 13.4 
N 1 43 40 


High 


Girls 


M 20.73 2 
SD 6.79 1 
N 48 48 4 


Low 
Girls 


* Significant at the .10 level, or less. 
** Significant at the .05 level, or less. 
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TABLE 3 


Neurotic INnpices or Boys 
IN THE 


AND Grris HiGcH 


Group Imp Anx BPI 


9.02 
2.96 


16.90 24.30 
831 11.36 
21 20 


High 
Boys 


21.40 
9.06 
18 


29.33 
13.45 
18 


Low 
Boys 


20.60 
9 36 


14 


21.71 
7.91 
14 


High 


Girls 


17.78 21.92 
SD 8.08 11.46 
N 27 2 


“ 


Low 


Girls 


that boys who are high on femininity are 
more impulsive, anxious, and neurotic than 
boys who are low on femininity, and that 
their neurotic concerns focus upon physical 
problems, feelings of insecurity, and irrita- 
bility. From Table 2 it can be seen that girls 
high and low on femininity do not differ sig- 
nificantly in their performances the 
neurotic indices. 

In order to examine the possibility that 
it was not the variation in masculine or 
feminine identification but response-set which 
accounted for some of the differences ob- 
tained, a further analysis was undertaken. 
This analysis was similar to that described 
above, except that in order to decrease the 
possibility of response-set being of impor- 
tance, boys and girls were chosen who were 
high and low on each scale, but were within 
the middle range on the other scale (i.e., their 
scores on the opposite-sex scale were between 
the 25 and 75 percentiles). Thus, one of the 
groups of boys and girls was of those who had 
high or low masculinity scores (upper and 
lower quartiles), but were within the middle 
range on femininity scores. Table 3 presents 
the results on the neurotic indices of boys and 
girls high and low on masculinity and middle 
range on femininity. Though none of the 
means achieve statistical significance, boys 
low on masculinity and middle on femininity 
tend to be more anxious and neurotic than 


on 


MippLe RANGE ON THE FEMININITY 


AND LOW ON THE MASCULINITY 
SCALE 


AND 


Ho Sch Phy 
3.00 
2.62 


1.50 
1.29 
20 


6.15 
4.52 
20 


8.39 
5.33 
i8 


6.38 
458 


24 


boys high on masculinity and middle on femi- 
ninity, with the focus of concerns on the 
physical and inferiority subscales of the 
Brown Personality Inventory. For the girls, 
those high on masculinity and middle on 
femininity tend to be more impulsive and 
anxious, with concerns about 
symptoms. 

Table 4 presents the results on the neurotic 
indices of boys and girls high and low on 
femininity and middle range on masculinity. 
There is only one instance in which signifi- 
cant differences are found, but boys high on 
femininity and middle range on masculinity 
tend to be more anxious and neurotic, with 
the focus of concerns about physical, inferi- 
ority, and irritability feelings. For girls, those 
low on femininity and middle on masculinity 
tend to score higher on measures of anxiety 
and neuroticism, with concerns surrounding 
the home. Apparently, both scales are of some 
use in the evaluation of sex role identification, 
though this confusion is reflected most clearly 
by the opposite sex scale. It is noteworthy 
that though they do not achieve statistical 
significance (possibly because of the de- 
creased size of the N), a number of the 
differences between the highs and lows in this 
latter analysis are of some magnitude, and 
are in the same direction as the scores when 
response-set is not controlled. 


physical 
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DISCUSSION 


The results of the present study show that 
the opposite-sex scale has a higher relation- 
ship to various measures of emotional sta- 
bility than does the like-sex scale. This 
suggests that the opposite-sex scales are more 
effective in the discrimination of faulty sex 
role identification. The like-sex scales, though 
discriminating between high and low scorers 
of the same sex, were not equally effective 
in detecting faultiness of sex role identifica- 
tion as reflected in these independent meas- 
ures of emotional stability. Apparently it 
would be most economical to use the opposite- 
sex scale in diagnosing sex role identification. 

The measures of masculinity and femi- 
ninity used in this study required children 
to indicate whether they played or did not 
play certain games. In addition, children were 
required to indicate whether they liked or 
disliked the games that they played. The fact 
that the responses of both boys and girls to 
play and game items of their own sex were 
not significantly related to measures of emo- 
tional stability suggests that the role expecta- 
tions for each sex are fairly explicit in this 
area of like-sex behavior. Apparently, boys 
are clear about what boys are supposed to 
do as boys, and girls are clear about what 
girls are supposed to do as girls, so that 
irrespective of sex role confusion, both sexes 
may make enough conventionally expected 


responses on the same sex scale for it not 
to be useful as an indicator of sex role 
confusion. On the other hand, the greater 
effectiveness of the opposite-sex scale suggests 
that role expectations of this sort may be 
more ambiguous. That is to say, chi'dren are 
less certain about how much interest they 
should show in the activities of the opposite 
sex. 

Examination of the qualitative differences 
between boys and girls who are high scorers 
on the opposite-sex scale and the independent 
measures of emotional stability shows that 
there are important sex differences in the 
type of sex role confusion yielded by this 
scale. For example, the boys who are high 
scorers on the femininity scale appear to 
show considerably more confusion than the 
girls who are high scorers on the masculinity 
scale. In previous papers, the authors have 
shown that femininity in boys is associated 
with high scores both on anxiety and im- 
pulsiveness (Sutton-Smith & Rosenberg, in 
press-a, in press-b). Those findings are con- 
firmed by the present study. In addition, 
the fact that the anxiety is expressed mainly 
on the subscales insecurity, irritability, and 
physical problems suggests that these boys 
have a very uncertain “self” picture. On the 
one hand they are anxious about their per- 
sonal and physical selves; on the other, they 
are prepared to indulge in impulsive acting 


TABLE 4 


Neurotic Inpices oF Boys AND Grris HIGH 


aND Low ON THE FEMIN 


IN THE MippLE RANGE ON THE MASCULINITY SCALI 


Group Imp Anx BPI 


10.13 24.81 
3.11 13.96 
15 . 16 


High 
Boys 


M 9.50 20.52 
SD 3.42 901 
N 22 21 


Low 
Boys 


M 7.20 25.00 
SD 7 12.38 
N 19 


High 
Girls 


M 27.19 
SD 3.25 10.48 
+ se 


Low 
Girls 


** Significant at the .0S level, or less. 
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3.44 

3.14 
16 


19 
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3.22 
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out behavior (“I like to throw snowballs,” 
“I like to chase fire engines”); and in addi- 
tion, they show a greater than normal prefer- 
ence for feminine play activities and games. 
While this empirically derived picture is a 
paradoxical one insofar as impulsiveness and 
femininity would appear superficially to be 
contraries, it is similar to a finding of Hartley 
(1959) based on interview materials that 
there are some boys who choose such a 
feminine-impulsiveness as a defense. There is 
some agreement then that at least one im- 
portant type of sex role confusion in children 
of this age group is to be found manifest in 
this feminine-impulsive syndrome. While the 
dynamics lying behind this syndrome are not 
at present apparent, the authors favor the 
view that impulsiveness is a means of warding 
off anxiety about sex role deviancy (abnormal 
identification with feminine role preferences) 
through pseudomasculine acting out behavior 
(Sutton-Smith & Rosenberg, in press-a). 
The highly masculine girls do not show 
quite the same inconsistencies in their various 
scores. Their high scores on the impulsiveness 
scale (on which males have higher average 
scores than females) are consistent with their 
high scores on the masculinity scale. Not only 
are the girls’ choice patterns consistent, they 
are also more culturally acceptable than the 
choice patterns of boys scoring high on fem- 
ininity. It ts not unusual for girls to choose 
masculine type activities in Western culture 
(Brown, 1958); in fact it is increasingly suit- 
able for them to do so (Rosenberg & Sutton- 
Smith, 1960; Sutton-Smith & Rosenberg, in 
press-c). Nevertheless, the presence of high 
anxiety scores in these girls suggests that 
their masculine identifications do involve 
some conflict. Apparently, it is perceived as 
a conflict between themselves and their par- 
ents (higher scores on the home subscale) 
who are seen as severe and rejecting, rather 
than a conflict in their own self picture. Pos- 
sibly, their masculine assertiveness may be a 
form of defense against parental rejection. 


SUMMARY 


The present study sought to examine the 
relative effectiveness of like-sex and opposite- 
sex scales in discriminating sex role identifica- 
tion. Each subject received scores on like as 


well as opposite-sex scales, and three inde- 
pendent measures of emotional stability. Ap- 
parently, children scoring high or low on a 
like-sex scale do not differ significantly on 
measures of emotional stability. Children 
scoring high on opposite-sex scales tend to be 
more anxious, impulsive, and neurotic than 
children low on opposite-sex scales. Explana- 
tions of the differing symptoms of such sex 
deviant boys and girls are offered. From the 
results of the present study, there is some 
doubt that like-sex scales, as they are tradi- 
tionally used, are as effective as heretofore 
suspected in discriminating sex iden- 
tification. 


role 
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TEST ANXIETY AND PERFORMANCE 


UNDER STRESS‘ 


ZANWIL SPERBER 
Children’s Hospital, Philadelphia 


There has been much interest in the effects 
of anxiety on performance in recent years. In 
a series of studies Taylor, Spence, and others 
at Iowa showed that subjects (Ss) with a 
high anxiety level performed better on a 
variety of simple tasks than Ss low in anxiety 
(Spence & Farber, 1953; Spence & Taylor, 
1951; Taylor, 1951; Wenar, 1954). With 
more complex tasks, anxiety level tended to 
have the opposite effect, low anxiety Ss per- 
forming better (Farber & Spence, 1953; 
Maltzman, Fox, & Morrisett, 1953; Monta- 
gue, 1953; Ramond, 1953; Taylor & Recht- 
shaffen, 1959; Taylor & Spence, 1952). These 
results were explained in terms of Hull’s 
theory of learning (Taylor, 1951; Taylor & 
Rechtschaffen, 1959; Taylor & Spence, 1952). 
Child (1954) criticized this theoretical ap- 
proach for concentrating on the character- 
istics (simplicity or complexity) of the task 
while ignoring the category of responses Ss 
learn to make to the cues provided by their 
own anxiety. 

Sarason and his colleagues have presented 


1A report of part of a dissertation submitted to 
the University of Michigan in partial fulfillment of 
the requirements for the PhD degree. The writer 
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members of the doctoral committee for their aid and 
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appreciated. The HRRC detachment at Sampson Air 
Force Base, commanded by Charles J. Meder, partic- 
ipated actively during the entire process of data 
collection. The assistance given by the entire group, 
and especially by Raymond J. Stubblebine and Wil- 
liam McMackin, is gratefully acknowledged. 

This research, however, was not sponsored by the 
United States Air Force, and the Air Force is not 
responsible for this report. 


a more complex theory for explaining the role 
of anxiety in performance. Their theory pre- 
dicts different effects from small and large 
amounts of anxiety, and takes into account 
the S’s response to his own anxiety (Mandler 
& Sarason, 1952; Sarason, Mandler, & Craig- 
hill, 1952; Wiener, 1959). They assume that 
two kinds of anxiety responses may be aroused 
by a testing situation: those which are self- 
centered and ego-defensive, and those which 
are evoked and learned in the course of task 
performance and are directed toward task 
completion. 

Small amounts of anxiety are held to im- 
prove performance by increasing the S’s mo- 
tivation and strengthening his task relevant 
responses. Larger amounts of anxiety lead to 
or strengthen the ego-defensive (Ra) re- 
sponses. These responses are characteristic of 
the S rather than specific to the task, always 
available in the response repertoire and read- 
ily evoked. Since Ra responses are self-cen- 
tered rather than task relevant they interfere 
with performance. Individuals with a high 
anxiety drive will have many R, responses in 
their response repertoire. In contrast, in rela- 
tion to the total number of responses avail- 
able, low anxiety Ss will tend to make more 
task relevant anxiety responses. Results ob- 
tained with Yale students as Ss and using a 
variety of tasks (e.g., Koh’s blocks, maze 
learning, digit symbol, motor learning, projec- 
tive material) generally support the theory 
(Mandler & Sarason, 1952; Mandler & Sara- 
son, 1953; Sarason & Mandler, 1952; 
son et al., 1952; Wiener, 1959). 

Sarason and Gordon (1953) also developed 
a measure of anxiety specific to the test stress 
situation to be studied, implicitly recognizing 
that anxiety need not function as a unitary 
drive, but might be a potential reaction of the 
individual which is only elicited under certain 
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conditions. Sperber (1959) has reported cor- 
relations between the scores obtained by Air 
Force recruits on Sarason’s Test Anxiety 
Questionnaire and their scores on the Taylor 
Manifest Anxiety scale (1953) and Winne’s 
Neuroticism scale (1951). Winne’s interpre- 
tation of his results indicates that his scale 
can be considered a general measure of mani- 
fest anxiety (pp. 120-121). The highest cor- 
relation between the test anxiety and general 
anxiety measures was 


+.35, suggesting that 
a general anxiety measure might not be an 
adequate gauge of an S’s proneness to anxiety 
in a test situation. Since the interest of the 
present research was in studying how anxiety, 
when evoked, operates to affect test perform- 
ance, Sarason’s measure of anxiety proneness 


specific to testing situations was used. 

The purpose of the present study was to 
determine whether there is any difference in 
performance between: (a) high test anxiety 
Ss under high vs. low stress, (6) low test 
anxiety Ss under high vs. low stress, (c) high 
test anxiety Ss vs. low test anxiety Ss under 
high stress, and (d) high test anxiety Ss vs. 
low test anxiety Ss under low stress. 

A further purpose was to bring additional 
evidence to bear on the different theoretical 
approaches to the problem of anxiety and 
performance presented by Taylor and Spence 
(1952) and by Sarason et al. (1952). 


METHOD 


Stress Environment 


virtue of its ability to 
induce anxiety in individuals subjected to the situa- 
tion. Lazarus, Deese, and Osler (1952) observed that 
“The principal problem in the study of behavior 
under stress has been the production of realistic stress 
situations” 


4 situation is stressful by 


p. 295). This experiment took place in 
association with a naturally stressful testing situa- 
tion. The experimental data were collected at Samp- 
son Air Force Base in December 1953, using new 
recruits as Ss. Prior to testing, an officer lectured 
them on the importance of the aptitude testing pro- 
gram they were to undergo. Many of the recruits 
hoped to benefit from the opportunity for some 
specialized vocational training which would equip 
them for a post service career; all were interested in 
getting a good assignment for their service period. 
The officer told them, in essence, that the realization 
of their hopes would be determined by the results of 
the aptitude testing 

The recruits were then successive 
days. The first day was devoted to the regular apti- 


tested on two 


tude testing program. The second day was scheduled 
for HRRC experimental testing, but the recruits 
were only told to report to the same building for 
more testing. The use of identical administration 
procedures by the HRRC and aptitude testing staffs 
reinforced the Ss’ tendency to view both days’ test- 
ing as serving the same purpose. 

High stress. Ss were allowed to continue believing 
that the experimental tests, presented as the Abilities 
Test Battery and administered by uniformed military 
personnel, were part of the official assessment pro- 
gram. The battery consisted of a large number of 
timed subtests. The instructions implied that the Ss 
were expected to finish all of the items in each sub- 
test, incompleted items counting as failures. The time 
limits were so chosen that the Ss could not possibly 
complete all of the items. The very first “test” was 
an interesting puzzle chosen to challenge and involve 
the S, but not solvable in the time allotted (Cowen 
1952, p. 514). It was included solely for the purpose 
of starting the Ss with a failure experience 

After completing the battery, a 40-minute inter- 
lude was devoted to procedures presented as not 
related to the “assessment program,” and adminis- 
tered by a civilian. Then the sergeants announced: 
“We're going to repeat two of the tests you took this 
morning. You've had them and you 
should improve. Let’s see how much better you do 
this time.” The first test was the puzzle 
provided a failure experience. Ss then 
Letter Substitution Test 

Low stre The instructions were 
make it clear that the Abilities Test Battery was 
experimental, and not part of the regular assessr 
program, but also to indicate the importance of the 
tests to the Air Force and to maintain a high level of 
task oriented motivation for good performance. For 
these Ss the first test was a simple line tracing task 
which started them with a success experience. The 
rest of the battery was the same as for the high 
stress group. The low stress Ss were interrupted the 
same number of times and in the same places, but 
the instructions allowed them to treat unanswered 
items as evidence that the test required changes, 
rather than as signs of personal inadequacy 

After the 40-minute interlude, instead of receiving 
instructions which implied that an improvement in 
performance was expected, the low stress Ss were 
told: “We're going to 
took this morning 
which should help, 
now—so let’s see 


practice on 


, which again 
repeated the 


designed to 


repeat two of the tests you 
You've had practice on them 
but you’re somewhat tired by 
what happens. Do the best you 
can.” 

To help underscore the dissociation of the tests 
from the regular program, the test ad- 
ministrator wore civilian clothes. To emphasize that 
the Air Force really had an interest in the procedure, 
the proctor, who was in a less active role, remained 
in uniform 


assessment 


2Stubblebine administered the performance tests 
under both high and low stress conditions, and Swen 
son proctored all of the testing 
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Performance Battery * 


A series of paper and pencil tests, suitable for 
group administration, and tapping a range of cogni- 
tive functions was used. With the exception of the 
atmosphere reinforcing puzzle or tracing task, high 
and low stress Ss worked on the same tasks, which 
are described in the order they appeared in the 
battery. 

Letter series. An expanded version of Thurstone’s 
(1943) PMA Reasoning subtest, containing the orig- 
inal 30 items and 20 new ones constructed by the 
writer, was administered in two consecutive parts. 
For each part, containing a random selection of 15 
original and 10 new items, 4 minutes were allowed, 
the parts being scored separately. 

Letter substitution. The items were taken from 
Lazarus and Eriksen’s (1952) expanded version of 
the Form I Digit Symbol test. The Ss were allowed 
34 minutes for 200 items. This test was repeated in 
the post interlude battery. 

Revised Minnesota Paper Form Board. The series 
MB version of this test (Likert & Quasha, 1948), 
which measures the S’s ability to perceive spatial 
relations, was administered in the usual manner. 

Number matching. This test is similar to the num- 
ber comparison sections of the Minnesota Vocational 
Test for Clerical Workers (Andrew, Patterson, & 
Longstaff, 1933), but new items were constructed by 
the writer. It measures the S’s ability to concentrate 
and attend to detail, the items requiring the S to 
make rapid discriminations of small differences. Each 
page contained 100 items and was timed and scored 
separately. There were three consecutive parts to the 
test, with 44 minutes allowed per page 


Test Reaction Questionnaire (TRQ) 


After the performance testing, the experimental 
purpose of the tests was explained to the Ss. Then a 
22-item questionnaire, constructed by the writer and 
similar in format to the Test Anxiety Questionnaire 
(Sarason & Mandler, 1952), was administered to 
ascertain the S’s reactions to various aspects of the 
testing situation.* The S put a mark on a 16-centi- 
meter line to indicate the strength of his reaction, the 
ends of the line representing polar responses to the 
question. Each question was scored by measuring the 
distance in centimeters from the end of the line to 
the S’s mark. 


Personality measures 


A slightly modified version of Sarason’s Test Anx- 
iety Questionnaire (TAQ) * was administered to the 


31 wish to thank Robert H. Bauernfeind, Editor 
of the Test Department, Science Research Associates, 
for permission to use the Letter Series items from the 
PMA (Thurstone, 1943), and Richard S. Lazarus for 
making available the extended version of the Digit 
Symbol test. 

*Copies of the TRQ may be obtained from the 
writer. 

5I wish to thank Seymour B. Sarason for making 
available a copy of his scale, and granting permission 
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Ss after the TRQ. Following Sarason and Gordon's 
(1953) suggestion, local norms, based on the re- 
sponses of both high and low stress Ss, were used in 
scoring each question. The resulting distribution of 
scores was very similar to that reported for Yale 
students (Sperber, 1959). 


Other Procedures 


The Minnesota Multiphasic Personality Inventory 
(MMPI) was administered in an afternoon session. 
Also available for each S was his stanine score on the 
Technician’s Specialty Index (TSI) (Dailey, Lecznar, 
& Brokaw, 1948), considered to be the military test 
which best measures general intelligence level. For 
most Ss, raw scores on the Armed Forces Qualifica- 
tion Test (AFQT) (Bolanovich, Mundy, Burke, & 
Falk, 1953), another gauge of intelligence level, were 
also available 


Subjects 


All 399 new recruits who arrived at Sampson AFB 
to begin basic training the week before this experi- 
ment was conducted constituted the original sample 
tested. The Ss were organized in six “flights” with 
about 65 men in each. Three flights were arbitrarily 
assigned to each of the stress conditions. Tests were 
administered to one or two flights at a time, 201 men 
performing under high and 198 under low stress 

Recruits with a TSI in the lowest two stanines, or 
an AFQT score less than 15 were considered too low 
in intelligence to participate, and their data were not 
analyzed. Twenty percent were eliminated by these 
criteria. Ss whose MMPI records did not meet ac- 
ceptable standards on the validity checks (Hathaway 
& Meehl, 1951) were also dropped, leaving 146 high 
stress and 148 low stress Ss. These men supplied the 
data on which were based the norms for scoring the 
TAQ, the correlations between test anxiety and gen- 
eral anxiety, and the analyses of reactions to the 
experimental conditions (TRQ responses). 

The performance of only those Ss who scored in 
the highest and lowest quartiles on the TAQ was 
studied. High test anxiety (HTA) Ss had scores of 
24 or higher, low test anxiety (LTA) Ss scored 11 
or less. There were 61 Ss under high stress, 32 HTA 
and 29 LTA. Under low stress there were 71 Ss, 33 
HTA and 38 LTA. These four groups were well 
matched with respect to age, years of education, and 
intelligence. For each group the mean age was close 
to 19, mean years of education was about 11, and 
the mean TSI stanine was approximately 5.5. 


RESULTS 
Effectiveness of the Experimental Operations 


The distribution of responses of the 146 
high stress and 148 low stress Ss to each of 
the questions of the TRQ were compared by 


for such changes in wording as were necessary to 
adapt it for use in a different context. 
® Personal communication from Abraham Carp 








Test Anxiety and Performance under Stress 


means of the Marshall test (Smith, 1953). 
The high stress Ss differed significantly from 
the low stress Ss in being convinced that the 
testing would decide their Air Force assign- 
ments. The low stress Ss more strongly ac- 
cepted the idea that testing was for experi- 
mental purposes. They differed | significantly 
(p< .05) on 11 of the 22 questions. High 
stress Ss felt more anxious, a greater pressure 
to do well, and a greater degree of unpleas- 
antness in the testing situation. The two 
groups did not differ in their evaluation of 
how much their emotions and anxieties had 
interfered with performance. 

The differences in TRQ responses of Ss 
scoring above and below the median on the 
test anxiety measure were tested separately 
for Ss who had performed under high stress 
and Ss who had performed under low stress. 
Under high stress, HTA and LTA Ss re- 
sponded differently to 10 of the 22 questions 
(Marshall test, p< .05). HTA Ss more 
strongly believed that their Air Force assign- 
ment depended on their performance, felt 
more anxious, thought their emotions influ- 
enced their performance adversely, and con- 
sidered the testing unpleasant. The groups 
did not differ with respect to test motivation. 

Under low stress, HTA and LTA Ss re- 
sponded differently to 4 out of 22 questions. 
There was a stronger tendency for HTA Ss 
to believe that their Air Force assignment 
might be at stake, and they responded with 
more anxiety to various aspects of the situa- 
tion. There were no differences in motivation, 
feelings about the pleasantness of the situa- 


TABLE 1 


PERFORMANCE DIFFERENCES OF MATCHED Hicu Test 


ANXIETY SUBJECTS UNDER HicH vs. Low STRESS 


Letter Series 1 

Letter Series 2 

Letter Substitution A 
Paper Form Board 
Number Matching 1 
Number Matching 2 
Number Matching 3 
Letter Substitution B 
Intelligence (TS] 


adf= 40. 
High Stress Ss were more variable (/ 
High Stress Ss were more variable (/ 
** 05 


2.53, p <.05) 


=2.43, p <.05). 


TABLE 2 


PERFORMANCE DIFFERENCES OF LOW 
SUBJECTS UNDER HIGH vs. 


Test ANXIETY 
Low STRESS 


High Stress 


Low Stress 


Test Mean g Mean o 


Letter Series 1 4.9 2 6.3 
Letter Series 2 6.2 7.9 3.0 
Letter Substitution A 98.0 108.9 29.2 
Paper Form Board 39.6 5 39.8 6.3 
Number Matching 1 55.6 58.4 13.6 
imber Matching 2 55.9 57.0 12.8 
imber Matching 3 55.9 62.0 16.4 
137.6 33.2 
5.7 1.6 


Ni 
Ni 
Letter Substitution B 133.5 


Intelligence (TSI) 5.2 


* df = 65 


> Low Stress Ss were more variable (F =2.38, p <.02 


the value of ¢ necessary for significance were calculate 
the method described by Edwards (1950, p. 167-169) 
¢ High Stress Ss were more variable (F =1.83, » <.05 
« Low Stress Ss were more variable (F =2.03, p <.05) 


*p< .05. 
tion, or evaluation of the influence their 
tions had on performance. 


Performance 


The 32 HTA Ss tested under high stress 
performed significantly better than the 33 
HTA Ss working under low stress on the three 
Number Matching tests. Critical ratios of the 
mean differences yielded p values of < .01, 
< .001, < .05. These results, however, were 
based on groups differing not only with re- 
spect to stress treatments, but also in test 
anxiety level. The high stress HTA Ss were 
significantly more anxious than the low stress 
HTA Ss (Marshal test, = .05). To control 
for this the HTA groups were matched for 
level of test anxiety by a random selection 
procedure (Sperber, 1956, p. 136), 21 HTA 
Ss remaining in each stress group. 

The results of the ¢ tests of the differences 
in mean performance of the matched HTA 
groups working under high vs. low stress is 
given in Table 1. The groups also were ade- 
quately matched for intelligence. Stress clearly 
influenced the performance of HTA Ss, the 
groups tested under high stress performing 
significantly better on two tests, Number 
Matching 1 and 2. 

The results of the ¢ tests of the differences 
in mean performance of the LTA Ss tested 
under high vs. low stress are presented in 
Table 2. The groups were adequately matched 
both for intelligence and level of test anxiety. 
The LTA Ss consistently had higher mean 
scores under low stress. For both reasoning 
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tests the mean differences reached statistical 
significance. 

The ¢ tests of differences in mean perform- 
ance of HTA and LTA Ss, both tested under 
high stress are presented in Table 3. On six 
of the eight measures the HTA performed 
better. The only statistically significant dif- 
ference was on Number Matching 2, where 
the HTA group was superior in performance. 
The results on the other two Number Match- 
ing tests supported this finding, both the 
mean differences favoring the HTA Ss, and 
approaching significance. For Number Match- 
ing 1 the value of p was .08, and for Number 
Matching 3 it was .10. 

The critical ratios, testing the significance 
of the differences in mean level of perform- 
ance under low stress of HTA and LTA Ss, 
are reported in Table 4. On all eight tests the 
LTA Ss were superior in performance. Two of 
the mean differences were statistically signifi- 
cant. For three more tests, namely, Number 
Matching 1 and 2, and Letter Substitution B, 
the differences were nearly significant, the 
values of p reaching .06, .09, and .07, re- 
spectively. 


DISCUSSION 


Our findings do not appear to be compat- 
ible either with the Iowa or with the Sarason 
theory. With respect to the Iowa position, we 
find that under high stress, HTA Ss perform 
better on Number Matching than do LTA Ss. 
Here higher drive seems to facilitiate per- 
formance, the correct responses for this task 
presumably being uppermost in the average 


TABLE 3 


PERFORMANCE DIFFERENCES OF HiGn Test ANXIETY 
AND Low Test ANXIETY SUBJECTS UNDER HIGH 
STRESS 


HTA 


Test Mean 


Letter Series 1 4.6 
Letter Series 2 7.2 
Letter Substitution A 106.0 
Paper Form Board 37.5 
Number Matching 1 61.0 
Number Matching 2 62.1 
Number Matching 3 61.3 
Letter Substitution B 134.8 
Intelligence (TSI) 5.4 


NMaOoeoOuUcnn 


= df = 59. 
> HTS Ss were more variable (F 
*p< 05 


2.19, p <.05) 


Sperber 
TABLE 4 


CRITICAL 
HicH 


RATIO OF PERFORMANCE DIFFERENCES OF 
Test ANXIETY AND Low Test ANXIE 
SUBJECTS UNDER Low STRESS 


LTA 


Mear 


Letter Series 1 5.8 ) 6.3 
Letter Series 2 7 7.9 3.0 
Letter Substitut 108.9 29.2 
Paper Form Board 39.8 6.3 
Number Matching 1 58.4 13.6 
Number Matching 2 57.0 12.8 
Number Matching 3 62.0 16.4 
Letter Substitution B 137.6 33.2 
Intelligence (TSI 5.7 1.6 


4 


* LTA Ss were more variable (/ 73, p <.01); CR nece 
for signific: > at < 5 wit t observed eteroge 
Variances 1s 2 | 16 169 

» LTA Ss were more variable 3.28 001). 


*np< OS 


S’s response hierarchy. On the same tests 
under low stress, however, LTA Ss perform 
better than HTA Ss, higher drive level ap- 
parently interfering with performance. Since 
the same test is involved in both cases, we can 
not readily attribute the reversal of the effects 
of a heightened anxiety drive to the different 
types response hierarchies elicited 
simple as contrasted with complex tasks. 


of by 


Taylor (1956) has commented on the many 


characteristics other than drive level in which 
anxious and nonanxious Ss may differ, and 
which may influence performance (p. 303). 
The interest of the Iowa group in their study 
of anxiety has been restricted to the effects of 
drive, although the influence of anxiety drive 
per se on performance has been acknowledged 
to be small (Spence & Taylor, 1953; Taylor, 
1956). Our own results confirm Taylor’s sug- 
gestion that to understand the performance of 
anxious and nonanxious Ss, a broader ap- 
proach to their characteristics is necessary. 
Such an approach will be indicated below. 
The effects of anxiety on the performance 
of our Air Force recruits were consistently 
opposite to the results reported by Sarason. 
For the Yale Ss, strong anxiety—as mani- 
fested by Ss who were both high in test 
anxiety and tested under high stress—inter- 
fered with performance, but for the recruits 
studied in the present research strong anxiety 
improved performance. For the Yale students, 
moderate anxiety—as manifested by HTA Ss 
tested under low stress, or LTA Ss studied 
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under high stress—facilitated performance, 
but for our recruits it was associated with 
poorer performance. For the Yale students, a 
near lack of anxiety—such as would be the 
case for LTA Ss studied under low stress— 
was paralleled by a lowered level of perform- 
ance. In contrast, for our Air Force recruits 
this lack of anxiety was associated with an 
improvement in performance. 

A rationale for the performance of our re- 
cruits, and the difference between their per- 
formance pattern and that of Yale students 
can be offered in terms of the S’s assessment 
of the personal importance of the testing sit- 
uation, and the tendency to use avoidance or 
vigilance as a defense against anxiety, de- 
pending on the importance of a situation. 

Douvan (1956) has demonstrated that an 
S’s evaluation of the importance of.a testing 
situation will depend on experiences associ- 
ated with his social status. She showed that 
working class adolescents become involved 
and strive to do well in a situation where good 
performance can earn them an immediate 
reward, but are not motivated to do well when 
no direct reward is at stake. In contrast, the 
middle class adolescent appears to be strongly 
motivated for successful performance in both 
situations. Our Ss were predominantly from 
the working class (Sperber, 1959). McNeil 
(1953) has shown that a low evaluation of 
formal education is characteristic of families 
from a working class background. Although 
the recruits were about the same age as Yale 
students, a majority of them never completed 
high school. Their fewer years of education is 
assumed not to be simply a function of lower 
intelligence level, but also to reflect skepti- 
cism about the value of an academic educa- 
tion. This attitude was apparently shared by 
their parents since many of them were young 
enough to require parental permission to en- 
list. The Yale Ss were predominantly from 
the middle class (Sarason & Mandler, 1952) 
where characteristically there are strong pres- 
sures for academic achievement (McNeil, 
1953). 

The operation of vigilance and avoidance 
as defenses against anxiety is conceived in 
terms akin to Bruner and Postman’s (1947) 
description of perceptual vigilance and de- 
fense : 


In any given situation the organism singles out 
what it considers to be the environment’s most rele- 
vant aspects—relevant to adaptation in the situation 
So long as the situation is not too threatening or too 
exacting, avoidance of meaning may be emotionally 
the most economical response. But in situations which 
are highly threatening and highly exacting, the most 
adaptive perceptual response 
which takes most 


76) 


is frequently the one 
vigilant account of “reality” (p 


With reference to the present findings, we 
view the high stress situation, testing which 
the S thought would determine his future, as 
one in which the S would be motivated to 
participate to the best of his ability. Partici- 
pative rather than avoidance behavior seemed 
most in the S’s self-interest, and we assume 
both HTA and LTA Ss were involved in the 
performance situation. We attribute the supe- 
rior performance of the HTA Ss to the func- 
tion of anxiety as an internal stimulus which 
could constantly remind them of both the 
importance and danger of the situation, and 
reinforce their motivation to be alert and per- 
form well. The performance of the LTA 
group did not benefit from this increase in 
vigilance. This rationale seems especially 
plausible when we consider the nature of the 
Number Matching tests, on which the signifi- 
cant performance differences occurred. The 
greater anxiety of the HTA Ss would main- 
tain their motivation to perform well on a 
repetitive, boring task, given at the end of a 
long session. An increment in perceptual vig- 
ilance would improve their discrimination of 
srr ill differences. 

Under low stress the LTA group performed 
better. The TRQ data allow us to assume that 
for the HTA Ss the situation was sufficiently 
like one involving real tests, to elicit feelings 
of anxiety. Since the instructions used assured 
the Ss that it was safe to be unconcerned 
about one’s own performance, i.e., no imme- 
diate gain for them was possible, they would 
tend to defend themselves by withdrawal 
when the task became noxious. This need to 
avoid became strong enough to affect perform- 
ance only on the more repetitious tasks used. 

Our analysis of the results of the present 
study has been in terms which are similar in 
many respects to the theoretical formulations 
advanced by Lazarus and his colleagues (e.g., 
Vogel, Baker, & Lazarus, 1958) and by Rue- 
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bush (1960). The present study and other 
recent researches (Ruebush, 1960; Vogel et 
al., 1958; Wiener, 1959) have all presented 
results which point up the complex interac- 
tion of anxiety, motive, defense, and task 
variables. A prediction of whether anxiety 
leads to an improvement or decrement in the 
performance of a given S appears to depend 
on the answers to four related questions: (a) 
Does the S accept the performance situation 
as something so important that he must par- 
ticipate in it, or as something that can appro- 
priately be avoided? (b) Does the anxiety 
aroused in the S by the situation lead to mo- 
tivations which are more important as deter- 
minants of his behavior than the actual 
amount of anxiety elicited? (c) With respect 
to the direct effect of anxiety on performance, 
what kind(s) of defensive behavior is (are) 
engendered in the S by varying amounts of 
anxiety? (d) What relationship obtains be- 
tween the nature of the S’s defenses and the 
structure and demands of the task? 


SUMMARY 


Air Force recruits, scoring in the highest 
and lowest quartiles on Sarason’s measure of 
test anxiety were tested under high and low 
stress. The significant performance differences 
were: 


1. Under high stress, HTA Ss performed 
better than LTA Ss on the Number Matching 
tests. 

2. Under low stress, LTA Ss performed 
better than HTA Ss on the Letter Substitu- 
tion and Number Matching tests. 

3. Performance of HTA Ss under high 
stress was superior to that of a group matched 
on test anxiety but subject to low stress on 
two of the three Number Matching tests. 

4. Performance of LTA Ss under low stress 
was superior to that of a group matched on 
test anxiety but subject to high stress on the 
two Letter Series tests. 


The results were discussed in relation to 
the findings of Taylor and Spence, and those 
of Sarason et al., and a theoretical formula- 
tion advanced which emphasized the complex 
interaction of anxiety, motive, defense, and 
task variables. 


Zanwil Sperber 
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WECHSLER’S DETERIORATION RATIO IN CLINICAL 
PRACTICE 


T. G. CROOKES 
St. John’s Hospital, Aylesbury, England 


Studies on Wechsler’s Deterioration Ratio 
have mostly suggested that it is of very lim- 
ited value in diagnosing brain damage. None 
of those, for instance, quoted by Yates (1954) 
showed any very convincing differentiation of 
brain damaged from other groups. Even where 
it distinguished brain damaged from “nor- 
mals,” it did not show any appreciable differ- 
ence between brain damaged and other psy- 
chiatric groups (Hall, 1952; Rogers, 1950), 
and it is this latter distinction which it is of 
practical value to make. The usual method is 
to take groups of known diagnoses, apply the 
tests to them, and compare the results. For 
reasons given below, this was not thought to 
be the most satisfactory way in this case, and 
it seemed to be of interest to examine the 
ratios in a group referred for routine purposes 
over a period of time, to look into the distri- 
bution of the values and compare them with 
the final diagnoses. 

The actual ratio used is not exactly the 
same as Wechsler’s. The same four “don’t 
hold” tests are used, but only two “hold” 
tests: Vocabulary and Picture Completion. 
The score on these is doubled, and the per- 
centage loss is calculated in the usual way 
with Wechsler’s age allowances. Object As- 
sembly was omitted because it does not seem 
to be a good hold test even on Wechsler’s data 
(1944, p. 150), and this is a rather embar- 
rassing test for adults, except the duller ones. 
Information was omitted to keep the balance 
of verbal and performance tests. The ratio 
will be referred to as DR. 


PROCEDURE 


Wechsler’s test is applied routinely to almost all 
patients referred for psychological examination at 
the above hospital. It is a regional National Health 
Service hospital, catering for all kinds of mental 
illness. 


The DR was calculated for all male inpatients re- 
ferred over a period of 6 years, whatever the reason 
for referral, providing they had done all the tests 
necessary for the ratio. In all cases it was the 
Wechsler-Bellevue Scale, Form I, and where the test 
had been repeated, the figures of the first testing were 
used. The diagnoses were the final diagnoses made on 
discharge or death, and, where the patient is still 
here, the latest diagnosis at the time of writing sup- 
plied by the psychiatrist in charge of the case. 

There are altogether 261 men, of whom 171 were 
less than 50 years of age at the time of testing, and 
90 were 50 or over. The main comparison is between 
those in whom the diagnosis implies physical inter- 
ference with the brain (the “organic” group) and the 
rest. The under-50 organic group consists of 23 pa- 
tients, 12 epileptics, 
tions (2 of them alcoholic), 2 cerebral syphilis, 1 
confusional state in disseminated sclerosis, and 2 
cases of unspecified brain disease. The over-50 or- 
ganic group contains 22 patients, 12 with senile or 
presenile dementia, 1 alcoholic dementia, 5 toxic con- 
ditions, 2 cerebral syphilis, 1 head 
cerebrovascular disease 


3 head injuries, 3 toxic condi- 


injury, and 1 


The remainder were divided into five broad diag 
nostic categories, to see if any type of score was 
peculiar to any group, and to see if the results agreed 
with the finding of Rogers (1950) that all malad- 
justed groups showed higher DRs than normals, but 
did not differ among themselves. The under-50 “non- 
organic” group consists of 30 depressives (including 
manics), 45 schizophrenics, 29 neurotics, 33 psycho- 
paths, and 11 paranoid states (including paraphren- 
ics). The over-50 group contains 40 
depressives, 1 schizophrenic, 10 neurotics, 9 psycho- 
paths, and 8 paranoid states 


nonorganic 


RESULTS 


Table 1 shows the means and ranges of 
DRs for the organic and nonorganic groups. 

In both age groups the organics have con- 
siderably higher values than the others. 
(Comparing organic and nonorganic means, 
for the under-50 groups, t = 4.866, p < .001; 
for the over-50, ¢ = 4.596, p < .001.) But for 
practical purposes, one needs to know to what 
extent they can be differentiated by taking 
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TABLE 1 
MEAN DETERIORATION RATIO, ORGANIC AND NONORGANIC 
Under-50 Over-50 
Group N Vf Range V 7 Range 
Organic 23 +28.9 +59 to —22 22 +23.0 +57 to —18 
Nonorganic 148 + &8 +57 to —42 68 - 0.4 +-42 to —52 


some critical value or “‘cutting score.” Table 
2 gives the number falling above +20 
(Wechsler’s suggested critical score) and 
above +30, by age group, and whether or- 


ganic or not. 

If the two age groups are combined, we 
find using a score of 20 that 76.6% are cor- 
rectly classified (organics 64.4% and nonor- 
ganics 79.2%). Using the score of 30, 85.1% 
are correctly classified (organics 53.3% and 
nonorganics 91.7%). The increase in total 
percentage correct obtained by raising the 
critical score from 20 to 30 is mainly due to 
the fact that the nonorganic group is much 
larger than the organic; by raising the score 
the discrimination of the nonorganic group is 
improved, that of the organic group is made 
worse, and as the former is so much bigger 
they contribute more to the total percentage. 
However, it will be noted that with the score 
of 30, the mean of the organic and nonorganic 
percentages (72.5) is slightly greater than 
their mean with the score of 20 (71.8). This 
mean can be considered as the percentage 
correct if the two groups are made equal in 
number; so the cutting score of 30 appears 
slightly better. In any case, it is probably of 
more value in this kind of problem to have a 


more extreme score, which nearly eliminates 
TABLE 2 
PROPORTIONS DISTINGUISHED BY CUTTING 
Scores oF +20 Ann +30 
I SO ) 0) 
Grot DR 1 DR 0) DR 1 RK 0 
ga 16 ) 15 


DR <3! DR>3J0 DR<31 DR>W 


Org 10 13 11 11 
Nonorgar 134 14 64 4 
86 83 








one group, providing it leaves a substantial 
number of the other. For instance, a score 
which gave 100% correct and 
50% organics, would be of more practical 
value than one which gave 75% of each, al- 
though the total percentage correct is the 
same. In the first case you would be able to 
feel virtual certainty when some scores came 
up, but never in the second case. The whole 
general question of how proportions obtained 
in this way can be used in a practical situa- 
tion, where the size of the populations from 
which you are drawing your samples is un- 
known, is considered in the discussion. 

Table 3 shows the mean values of the DRs 
for the nonorganic patients divided into five 
diagnostic categories, again separated into the 
two age groups. 

The most striking thing about Table 3 is 
that the younger patients consistently give 
higher values than the older ones, while 
within each group, there is little difference 
between diagnoses. This is fairly clear by 
inspection, but a simple analysis of variance 
was carried out for each age group (leaving 
out the schizophrenic in the over-50). In each 
case, the between diagnoses mean square was 
smaller than that within diagnoses. The schiz- 
ophrenics and paranoid states, under-50, were 
combined (one can find reasons why they 
might be combined) and compared with the 
rest of their age group. This gives ¢ = 1.31, 
p > .1. In the older age group, the neurotics 
were compared with the rest; ¢ = 1.63, p 

> 1. It will be seen also that the high values 
of the DR are fairly evenly distributed among 
the various groups. 

From the way in which the DR is calcu- 
lated, the expected mean value for any age 
group is nought, if the group is comparable 
with Wechsler’s standardization group. For 
the over-50 group, the mean is very close to 
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TABLE 3 


DETERIORATION RATIO OF NONORGANIC PATIENTS By DIAGNOSIS 


Under-50 


Group Range 
Depressives 
Schizophrenics 
Paranoid states 
Neurotics 
Psychopaths 


+47 to —42 
+57 to —34 
+31 to —14 
+46 to —26 
+49 to —29 


Total +57 to —42 


nought, but not for the under-50 group; com- 
paring their mean (+8.81) with its standard 
error (1.50) gives ¢ = 5.87, p < .001. 


DISCUSSION 


In differentiating organics from nonorgan- 
ics, the results here seem rather better than 
those obtained in most studies. One reason 
for this is probably that these are not selected 
groups of patients, but patients referred for 
information in the routine way. This means 
that in most cases the diagnosis was in doubt 
at the time of referral, and the deterioration 
was not marked. Gutman (1950), in summing 
up her study, says that hers were definite 
cases and that in less advanced cases even 
poorer differentiation would be expected. In 
general, the principle of taking extreme cases 
is excellent, but in this case it is not appro- 
priate. The theory behind the ratio is that 
certain types of activity deteriorate more 
rapidly than others with aging and other 
forms of damage to the brain. It is not sug- 
gested that the other activities do not dete- 
riorate at all. Clearly with gross dementia all 
abilities, including Vocabulary, disappear, cf. 
Yates (1956). It might be expected that the 
discrepancy would be most clear in the early 
stages, while later on all abilities declined to 
a common level. 

Hall’s (1952) organic cases are described 
as mostly moderately deteriorated, and he 
found only 5 out of 24 with ratios over +29 
and only 7 over +19. His inclusion of Object 
Assembly may partly account for this. The 
group described by McFie and Piercy (1952), 
consisting of patients with localized cerebral 


Over-50 


n> 30 ] VU Range 


+08  +42to 

+14 
+01 +26 to —36 
-96 +32 to —29 
+-28 to —16 


-52 


+-2.6 


—().4 


+42 to —52 


lesions, gave results fairly close to those of 
the present study. Of 56 DRs calculated, they 
had 24 (43%) over +29 and 37 (66%) over 
+19. For my whole organic group, the equiv- 
alent percentages are 53 and 67. 

An interesting problem is the question of 
what weight can be given to an actual DR in 
a practical situation. If one has obtained a 
ratio greater than 30, what can be said about 
it? In this group, 18 out of 216 nonorganics 
obtained such a score, or 1 in 12; 24 out of 45 
organics did so, rather more than half. So, in 
a sense, it can be said that such a score has a 


good probability of being organic; an organic 
case is much more likely to produce one than 
a nonorganic. However, if one considers the 
distribution of the 42 such scores obtained, 
only 24 are organic, and the probability of a 
given score of over 30 being organic is only 


4/7, not much more than a half. In general, 
this is because the probability of an event’s 
belonging to one of two categories depends 
not only on the relative frequency of the 
event within each category, but also on the 
relative frequency of the two categories in the 
population from which the event is drawn. 
This seems to be a general difficulty in all 
tests of this kind except where a certain score 
is found in virtually 100% of one category. 
The proportions found in one situation cannot 
be used for giving a probability in another 
situation where the constitution of the popu- 
lation is not known, as in most clinical sit- 
uations. 

The information has to be combined with 
data from other tests, and with qualitative 
indications on the Wechsler itself, for ex- 
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ample, differences on the Similarities (Hall, 
1952) and “rotation” on the Block Designs 
(Shapiro, 1951), which are known to be re- 
lated to the distinction being made. Another 
approach might be to attempt to eliminate 
individuals who come at the extreme of the 
distribution of DRs without having deterio- 
rated. According to Wechsler (1944, p. 66), a 
DR of +20 is 2 Probable Errors from the 
mean, so that in a normal group one would 
expect to find about 9% above +20, and 
about 2% above +30. Subjects with the 
classical type of congenital reading disability, 
in which ordering and spatial organization are 
involved, might be expected to make poor 
scores on don’t hold tests, especially Digit 
Span, Arithmetic, and Block Designs. There 
are three subjects in the present series with 
reading difficulties marked enough for it to be 
mentioned as one of their problems. They are 
all in the under-50 nonorganic group, and all 
have DRs greater than +29. It may be noted 
that the present group agrees with Wechsler’s 
in the distribution at the other extreme of the 
scale. There are 8 minus ratios greater than 
30 (3%) and 21 greater than 20 (82). 
Another thing associated with high DRs is 
low scoring on the test. Hold scores of 4 
and don’t hold scores of 2 do not suggest 
much absolute difference in the abilities con- 
cerned, but they would give a DR of +50. In 
the nonorganic under-50 group, there are 19 
patients with IQ less than 80; 6 of these have 
DRs over +30, as opposed to 8 out of the 
other 129 with IQ over 79. This is not of 
much practical help. because low scores also 
occur in organic cases. In the organic under- 
50 group, there are 5 out of 23 with IQ below 
80, and all 5 have DRs over +30. However, 
it does increase one’s confidence in the signifi- 
cance of a high DR in a person of higher IQ. 
As far as the under-50 nonorganic group is 
concerned, the study confirms the finding of 
Rogers (1950) that psychiatric groups show 
a higher DR than normals but do not differ 
among themselves. The over-50 nonorganic 
group, however, does not show more deterio- 
ration than the normal expectation. This cu- 
rious age difference was also found, in psychi- 
atric patients, by Garfield and Fey (1948). 
A similar age difference, in the same direction, 
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appears in the organic groups. One explana- 
tion could be that the don’t hold tests are 
affected by either mental illness or aging, but 
the two factors do not have an additive effect 
when they occur together, so that the older 
patient is not affected more than is allowed 
for in the age corrections; alternatively, the 
illness, when combined with greater age, 
might affect the hold as well as don’t hold 
tests. If patients were retested after recovery 
with the equivalent form of the test, on each 
hypothesis the younger ones would be the 
same on the hold tests, and better on the 
don’t hold; on the first hypothesis the older 
patients would be the same on both types of 
test, on the second, they would be better 
on both. 

The fact that mental illness has this tend- 
ency to raise the DR does not destroy the 
utility of the index in the diagnosis of brain 
damage, since the latter raises it significantly 
more. An index of over 30 gives a strong 
initial suggestion of brain damage, especially 
when the IQ is not low. Whatever method of 
assessing deterioration is used, an estimate of 
intelligence is needed in evaluating the re- 
sults, so that Wechsler’s Scale can be used to 
fulfil a double purpose. 


SUMMARY 


A modified form of Wechsler’s Deteriora- 
tion Ratio was calculated retrospectively for 
261 male mental hospital inpatients who had 
been tested routinely over a period of 6 years. 
The scores were compared with the eventual 
diagnoses, and it was found that the ratio 
distinguished organic and nonorganic cases 
fairly well. In the nonorganic group, there 
was no variation corresponding to diagnosis, 
but there was confirmation of previous find- 
ings that psychiatric patients give larger 
ratios than normals. This, however, applied 
only to the younger group (under-50). 
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Traditionally, vocabulary has been re- 
garded as one of the functions affected least 
and latest by impairment due either to 
schizophrenia or to brain damage. In a recent 
study, Smith and Kinder (1959; Smith, 
1958) report no significant loss in vocabulary 
for topectomized schizophrenics as compared 
with controls, 8 years after surgery. On a 
variety of other tests the brain operated 
subjects did show significant losses. Smith 
has suggested that significant losses tend to 
appear more frequently on measures involving 
sustained attention and perceptual organiza- 
tion than on measures involving vocabulary 
and verbal skills. 

Williams, Lubin, and Gieseking (1959) 
report seemingly opposite results for non- 
schizophrenic subjects with a variety of brain 
conditions: vocabulary and verbal skills not 
only: show clearly significant losses, but 
furthermore the amount of loss is as much, 
or even more, than on spatial tests. Subject 
samples, site and nature of tissue damage, 
lapse of time following the lesions, and the 
type of vocabulary test used raise many ques- 
tions. The present study focuses on the last 
of these three factors: what is the effect of 
an oral test format, as compared with a 
multiple-choice format, on vocabulary scores 
following brain surgery? 


1 Now at the Veterans Administration Hospital, 
Danville, Illinois. The senior author’s participation in 
this research was made possible by support from 
Western Psychiatric Institute, Henry W. Brosin, Di- 
rector; as well as by support from the Topectomy 
Follow-Up Project with a research grant (M-1191) 
from the National Institute of Mental Health and 
funds and assistance from the Research Foundation 
for Mental Health, Inc. of the New York State 
Department of Mental Hygiene. 


This factor of test format, and of the 
testing procedure, has received little attention 
in studies of deficit following brain damage. 
The Williams et al. study does comment on 
the test used. These authors base their con- 
clusions on the Reading and Vocabulary 
(RV) Test from the Army Classification 
Battery (ACB). They offer evidence for the 
essential equivalence of the RV test and the 
Wechsler oral Vocabulary test, citing correla- 
tions of .76 between these two tests for Army 
recruits, for enlisted men, and for brain 
injured patients. They also note that the 
“majority of the items in RV require the S 
to read and understand a paragraph. Only 
a minority are standard vocabulary items.” 

Although the present study limits con- 
sideration to test procedure, some comment 
on the general and often contradictory evi- 
dence for vocabulary deficit following brain 
damage should be offered. Two of the most 
relevant and adequate studies are those of 
Yates (1956) and of Weinstein and Teuber 
(1957). Yates, on the basis of an extensive 
review, concludes that “vocabulary does 
decline in patients suffering from brain- 
damage.” This general conclusion may be 
supplemented by Weinstein and Teuber’s 
careful consideration of differential effects 
when lesions involve specific brain areas. 
With pretest and posttest scores for both 
injured and control subjects, these latter 
authors report obvious loss in verbal and 
vocabulary scores some 10 years after focal 
lesions involving the left parietal-temporal 
areas, and/or with aphasia as a symptom. 
For other focal lesions, including frontal lobe 
lesions, they report little or no loss in vocab- 
ulary and similar tests. 
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In regard to the more immediate question 
of test format and test procedure, vocabulary 
deficit may well be in part a function of the 
vocabulary test used. In the previously cited 
study, Weinstein and Teuber (1957) found 
little verbal loss in certain brain injured 
groups; but these same subjects showed 
obvious loss on a hidden picture test. Is the 
multiple-choice test something like a hidden 
picture test, with the correct answer word 
“perceptually hidden” to some degree among 
three or four other words? Again, in the 
Smith and Kinder (1959) study, the Capps 
Homograph Test showed significant loss al- 
though oral vocabulary did not. The Homo- 
graph Test is a vocabulary test which intro- 
duces an additional element, requiring the 
subject to make a conceptual shift from one 
definition to another for the same word. 

The present study used three test formats 
with subjects from the same sample studied 
by Smith and Kinder (1959): one oral, and 
two multiple-choice, tests of vocabulary. The 
first multiple-choice procedure maximized the 
factor of sustained attention. The second 
multiple-choice procedure minimized the fac- 
tor of attention. The element of perceptual 
organization was assumed to be inherent in 
both multiple-choice procedures. 

Since the subjects had taken Wechsler oral 
Vocabulary tests before surgery and again 
8 years after surgery, the chief interest in the 
present study was in the new multiple-choice 
procedure which was designed to require both 
sustained attention and perceptual organiza- 
tion. The same words used in this multiple- 
choice procedure were then given orally and 
again under modified multiple-choice condi- 
tions to determine the comparability of these 
words to the Wechsler words, and to explore 
the extent to which each subject could achieve 
an approximate definition under any condi- 
tions. Three hypotheses were considered: 


1. Topectomized schizophrenics will show 
significant loss on a multiple-choice test of 
vocabulary which maximizes the importance 
of sustained attention. 

2. Topectomized schizophrenics will show 
no significant loss when the same words are 
then given as an oral vocabulary test. Smith 
and Kinder (1959) had already reported no 
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loss in oral vocabulary for these subjects 8 
years after surgery. The present oral test was 
designed to insure that the specific word list 
used did not account for a difference in 
results. 

3. Topectomized schizophrenics will show 
a tendency to loss when the same words are 
given a third time in a multiple-choice pro- 
cedure minimizing the importance of atten- 
tion. The number of correct definitions was 
expected to be greater than on the first 
multiple-choice procedure and less than on 
the oral procedure. 


METHOD 


Subjects. The subjects and surgical procedures 
have been described in detail elsewhere (Lewis, 
Landis, & King, 1956; Smith & Kinder, 1959). The 
40 subjects included in the present study were all 
patients at Rockland State Hospital, Orangeburg, 
New York. Of these 21 were operated subjects, and 
19 nonoperated controls. All were from the New 
York State Brain Research Project, had been diag- 
nosed as schizophrenic, and had been included in 
the study reported by Smith and Kinder. 

The operations had been performed 10 years prior 
to the present study. The subjects had been origi- 
nally divided into a C group of older subjects (age 
range 47-58 as of 1949) and a D group of younger 
subjects (age range 1949). Surgery con- 
sisted of an orbital or a superior topectomy, both 
bilateral frontal lobe operations. 

Thirty-eight were assigned to 19 pairs 
of operated and control patients, matched for pre- 
operative Wechsler Vocabulary. In 15 pairs, it was 
possible to match the subjects by chronological age 
group. In the other 4 pairs, they were matched for 
Vocabulary but not for age 

Tests. In addition to the Wechsler Vocabulary 
before and 8 years after surgery, the 40 subjects 
were given three new vocabulary tests 10 years 
after surgery. The five tests, with the designation 
used in Table 1, were as follows: 


21-38 in 


subjects 


1. Pre: The preoperative vocabulary score is the 
average of two Wechsler oral Vocabulary raw scores 
obtained within a period of a few months before 
surgery in 1949. On occasion, only one Wechsler 
was given before surgery, rather than the usual two 

2. Post-8: The average of two Wechsler oral 
Vocabulary tests given within a 30-day interval 
8 years after surgery 

3. Oral: The 60 words in the first civilian edition 
of the Army General Classification Test (AGCT, 
1947) were given orally and scored as in the 
Wechsler. 

4. MC-1: The first multiple-choice procedure 
consisted of the same 60 items from the AGCT. 
The stem phrase was presented on a card for 5 
seconds: for example, “BIG means the same as... .” 
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The stem card was then removed, and a 10-second 
delay followed. The four answer words from the 
AGCT were then presented, without the stem card, 
and the subject marked his response. The delay 
procedure emphasized the necessity for sustained 
attention. 

MC-2: The second multiple-choice procedure 
employed the same materials. The four answer 
words from the AGCT were presented first, and the 
subject read them aloud. Mistakes in reading were 
corrected. The stem card was then presented and 
read aloud by the examiner, with the answer words 
still in view. The subject then gave his answer 
orally. At every step, the examiner encouraged 
maximum attention to each element in the task. 
This procedure minimized the effect of variable 
attention. 


In the Oral, MC-1, and MC-2 procedures, a 
“base” and “ceiling” were used where obviously 
indicated; subjects responded to an average of some 
45 of the 60 items. A correction for chance suc- 
cesses was used. The order of the tests was: MC-1, 
Oral, MC-2, with MC-2 given in a second session. 
The advantages and disadvantages of a rotated order 
for the tests were considered. With the number of 
subjects available and the anticipated variability of 
psychotic behaviors, it was felt that exactly the same 
procedure should be used with all subjects, in order 
not to lose the precision in analysis offered by pair 
matching. As previously indicated, the oral and 
second multiple-choice procedures were designed as 
control observations. The unexpected results with 
the second multiple-choice procedure will be dis- 
cussed later. 

Analysis. Results were analyzed by the sign test, 
and by ¢ tests for differences between correlated 
means. p values refer to a one-tailed test.? 


RESULTS 


Table 1 presents the results for the five 
vocabulary tests in terms of the number of 
controls surpassing topectomized patients, for 
19 pairs of subjects matched on preoperative 
Wechsler Vocabulary. Significant values for 
the sign test are indicated. The results will 
be presented under the following headings: 
MC-1, Oral, MC-2, Wechsler, and the three 
hypotheses. 

MC-1. The first multiple-choice procedure 
incorporated both attention and perceptual 
organization, as well as word knowledge. It 
was expected that this procedure might dif- 
ferentiate the topectomized and _ control 


2 Ardie Lubin, and Elizabeth Engle of the Walter 
Reed Army Institute of Research have provided the 
authors with detailed statistical analysis of all the 
major comparisons reported. In all statistics re- 
ported, their results confirm the figures contained 
in this paper. 


TABLE 1 


NuMBER OF CONTROLS SURPASSING TOPECTOMIZED 
PATIENTS ON FIVE VOCABULARY SCORES, FOR NINE 
TEEN Pairs OF SUBJECTS MATCHED ON PRI 

OPERATIVE WECHSLER VOCABULARY 


Pre Post-8 MC-1 Oral MC-2 


14* > 13° 16** 
0 2 i 0 


Pre = Preoperative Wechsler 
Post-8 =Wechsler (1957) 
MC-1 =Multiple-choice, wit! 
Oral =New oral vocabulary 
MC.-2 =Multiple-choice, no 


*p =.05. 
*p =.01. 


1949 


lelay (1959 


1959) 


lelay (1959 


groups. It failed to do so. Neither the sign 
test (Table 1) nor the difference between 
means (¢ = 1.41) is significant. As with every 
vocabulary test given after surgery, the con- 
sistent tendency for the controls to surpass 
the operates is apparent. The mean superi- 
ority of the controls is 4.3 words. 

Oral. Since Smith and Kinder (1959) 
reported no significant difference in oral 
vocabulary for these subjects, it was expected 
that the new Oral vocabulary test would not 
differentiate the topectomized and control 
groups. This expectation was further encour- 
aged by the high correlation between the new 
Oral and Wechsler oral given 2 years earlier: 
for 41 subjects, this correlation was .89. For 
19 pairs matched on preoperative vocabulary, 
13 pairs show Oral scores higher for the 
control subject, 5 higher for the operate, and 
1 tie. The sign test is significant at the .05 
level, suggesting some loss in oral vocabulary 
due to brain surgery. The ¢ for the difference 
between means, however, is 1.29 and not 
significant. The average superiority of the 
control over the operate is 2.3 words. 

MC-2. The second multiple-choice pro- 
cedure minimized the importance of sustained 
attention but retained the factor of perceptual 
organization. This vocabulary test format 
yielded a clearly significant difference associ- 
ated with brain surgery. Of 19 controls, 16 
surpassed the topectomized subject with 
whom they were paired. The sign test is sig- 
nificant at the .01 level. The ¢ for the dif- 
ference between means is 3.21, also significant 
at the .01 level. The mean superiority of the 
controls over the operates is 6.3 words— 
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nearly three times as much as for the same 
words given orally.® 

The serial order in which the tests were 
given could be important: MC-1, Oral, MC-2. 
The control subjects showed a consistent in- 
crease in mean score on each later procedure: 
30.9, 35.6, and 38.1. The topectomized sub- 
jects obtained the following mean scores on 
the respective tests in order of presentation : 
26.6, 33.3, and 31.8. 

Wechsler. Smith and Kinder, using analysis 
of covariance, report no significant loss in 
Wechsler oral Vocabulary 8 years after 
surgery for these subjects. Pair matching, 
however, indicates significant loss. For 19 
pairs, the control surpasses the operate in 
14 cases. The sign test is significant at the 
.05 level. The ¢ for the difference between 
means is 2.46, also significant at the .05 
level. 

To confirm this evidence for loss in 
Wechsler vocabulary, the data on all 52 sub- 
jects used in the previous study were re- 
analyzed by pair matching. It was possible 
to match each of the 24 control subjects 
with a topectomized subject on preoperative 
Wechsler Vocabulary. Inspection indicates 
that the 4 remaining operated subjects ob- 
tained comparable scores. The sign test 
(with the control surpassing the operate in 
18 pairs) and the difference between means 
(t= 2.42) are both significant at the .05 
level. The sign test just misses significance 
at the .01 level (p = .011). The relative mean 
loss due to brain surgery is 2.6 words. 

Hypotheses. Results for the three hy- 
potheses follow: 


1. The topectomized schizophrenics do not 
show a significant degree of relative loss on 
the multiple-choice vocabulary test requiring 
both sustained attention and perceptual 
organization. As with all the vocabulary tests 
given after surgery, the consistent tendency 
for the controls to surpass the operates is 
suggested. 


3 The gain from Oral to MC-2, for the 19 pairs 
of subjects, was greater for the control 15 times, 
for the operate 3 times, with 1 tie. This manipula- 
tion of scores has questionable justification, but does 
suggest the effect of different test formats on the 
same words. 
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2. The topectomized subjects do show 
significant loss when the same words are then 
given orally. The sign test for the new Oral 
vocabulary indicates a difference in favor of 
the controls at the .05 level of confidence. 
The superiority of the controls need not be 
associated with the serial order in which the 
tests were given in the present study, since 
results for the Wechsler oral Vocabulary, 
given 2 years earlier, are comparable. Both 
the sign test and difference between means is 
significant at the .05 level. 

3. The multiple-choice test which mini- 
mized the necessity for sustained attention 
clearly differentiates the operates from the 
controls. This test was given last in all cases. 
The topectomized schizophrenics show more 
loss, and more significant loss, in vocabulary 
on this test than on any of the others. The 
differential effect of ‘test procedure” is indi- 
cated, but both format and order of 
presentation may contribute to the result. 


test 


DISCUSSION 


The present study considered specifically 
the effect of test format on vocabulary scores 
after brain surgery. The results do suggest 
that test procedure may be an important 
factor. However, another factor in test pro- 
cedure, the order of test presentation, may 
influence the results as much as the test 
format. This Discussion will consider briefly 
the general results of the study, and will then 
comment on the special question of test 
procedure. 

The evidence for some loss in vocabulary 
in topectomized patients, as compared with 
controls, 10 years after surgery is not sur- 
prising. The vocabulary losses reported here 
are small in terms of the instruments’ pre- 
cision: the relative loss in Wechsler Vocabu- 
lary is some 2.5 words. The number of 
studies, employing comparable controls, with 
which these results can be compared is ex- 
ceedingly small. Only a few reports present: 
control subjects acceptably matched with 
experimental subjects on the basis of tests 
before brain damage; comparable pretest and 
posttest scores, especially after the lapse of 
a number of years; and any reasonably ade- 
quate evidence for the location of the lesion. 
Perhaps the most nearly comparable study is 
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that of Weinstein and Teuber (1957) previ- 
ously cited. These authors reported marked 
loss on verbal and vocabulary tests some 10 
years after certain focal brain injuries, but 
little or no loss on such tests when only the 
frontal lobes were involved. The table which 
they present does indicate some tendency to 
relative loss for every injury group con- 
sidered, whatever area of the brain was in- 
volved. That is, some injury groups gained 
over their preinjury scores, but the gain was 
never as great as the gain made by the con- 
trol subjects. The present results are similar: 
slight, but in this case sigiificant, loss in 
vocabulary 8 and 10 years after topectomies 
involving frontal lobe areas. Since Weinstein 
and Teuber (1957) report on a group test, 
on nonpsychotic subjects, and on injuries 
rather than bilateral operations, their study 
is only roughly comparable to the present 
one. The striking loss in verbal skills reported 
by Williams et al. (1959) must be evaluated 
in light of the heterogeneous brain conditions 
involved and the relatively short length of 
time following recovery. The evidence sug- 
gests that vocabulary deficit some 10 years 
after limited frontal lobe lesions is not great, 
but may be statistically significant under 
conditions such as those of the present study. 

The effect of a specific test procedure em- 
ployed in measuring vocabulary deficit has 
been given only incidental attention in previ- 
ous studies. In one sense, the present results 
are clear. These topectomized subjects tend 
to show some deficit in vocabulary on every 
vocabulary test given 8 or 10 years after 
surgery. The effects of surgery are most 
clearly apparent, however, on the multiple- 
choice procedure MC-2, which was given last 
in order of testing. The effect of test proce- 
dure is indicated, but either test format or 
order of test presentation could account for 
the results. 

In the present study, the effect of format 
and of order may be closely related. This 
suggestion calls for a final word on the special 
problem of demonstrating effects due to 
surgery when both experimental and control 
subjects are markedly psychotic. For such a 
purpose, the most suitable test may be one 
which minimizes the capricious and unpre- 
dictable vagaries of schizophrenic behavior, 


and yet retains some element related to brain 
damage. The MC-2 procedure may have been 
most effective for this reason. Each element 
in the task was presented separately, and 
every effort was made to insure active at- 
tention. Reading errors were corrected, and 
the subject reviewed each correction. When 
the crucial answer was finally permitted, the 
schizophrenic patient’s sporadic or “func- 
tional” inattention to reality may have been 
largely overcome. The resulting optimal 
capacity to define words and deal with per- 
ceptual organization could then constitute a 
measure which would demonstrate the effects 
of surgery. The importance of simplifying 
elements, insuring attention, and using re- 
peated measures should be_ considered 
Paradoxically, such precautions might result 
in a measure of some aspect of “attention” 
which would reflect the effects of 
surgery. 

The most discriminative test, MC-2, was 
given /ast to all subjects, after they had had 
repeated exposure to the words involved. Of 
the two oral tests, the one with which the 
subjects had had previous experience dis- 
criminated best. Sheer’s (cf. Lewis, Landis, 
& King, 1956) astute observations on “prac- 
tice gains” are related to the importance of 
driving each element in the task home before 
expecting schizophrenic subjects to respond 
in a sufficiently comparable manner to per- 
mit measurement of differences. That is, 
familiarity with a test may serve to decrease 
unpredictable and irrelevant schizophrenic 
behaviors. Unless the idiosyncratic capri- 
ciousness of schizophrenic behavior is taken 
into account, relevant differences 
readily obscured. 


brain 


may be 


SUMMARY 


The study considered the effect of test pro- 
cedure on measures of vocabulary deficit § 
and 10 years after bilateral topectomy. Three 
vocabulary tests, each employing the same 
words, were given to 21 operates and 19 
controls. The three tests were a multiple- 
choice test maximizing the necessity for sus- 
tained attention, an oral test, and a multiple- 
choice test minimizing the importance of 
attention. The results and conclusions follow: 
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1. The topectomized schizophrenics showed 
a consistent tendency to vocabulary deficit on 
all tests, either oral or multiple-choice. The 
loss was statistically significant in most cases, 
but slight: for example, a mean loss of some 
two and a half words in Wechsler Vocabulary. 

2. Vocabulary deficit was greater, and 
most significant, on the multiple-choice pro- 
cedure which minimized attention, and which 
was given after the other two tests. This pro- 
cedure differentiated the operates and con- 
trols at the .01 level of confidence. The mean 
superiority of the controls over the operates 
was nearly three times as many words as for 
the same words given orally. 

3. The results suggest that the test pro- 
cedure used may be an important factor in 
studies of vocabulary deficit. Relevant con- 
siderations include both the test format and 
the order of presentation. Both format and 
order may contribute to the results of the 
present study. 
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AN EVALUATION OF THE 


NORTHWESTERN INFANT 


INTELLIGENCE TEST, TEST B 


BERNARD B 


Onondaga (¢ 


The Northwestern Infant Intelligence Test, 
Test B (Gilliland, 1951) contains 40 items 
and is designed for use with infants between 
the ages of 13 and 36 weeks. Preliminary sta- 
tistical work with the test (Gilliland, 1951, 
p. 16) suggests that it may be a more suit- 
able instrument for the assessment of infant 
intelligence than other tests for this age pe- 
riod. This paper attempts to provide a sys- 
tematic assessment of reliability and validity 
of the test as well as other quantitative and 
qualitative considerations. 


METHOD 


The subjects consisted of 100 adoptive or boarding 
home babies of both sexes between the ages of 13 and 
36 weeks inclusive. In the first phase of the study 
the Cattell Infant Intelligence Test (Cattell, 1947) 
was administered to the baby by a qualified ex- 
aminer.- The second phase occurred 3 days later 
when the Northwestern Infant Intelligence Test, 
Test B was administered to the baby by a different 
examiner. The third phase involved the readminis- 
tration of the Cattell by the original examiner * when 
the baby was 18 months of age.® 


The Child and Family Service and the Depart 
ment of Public Welfare, Children’s Division, both of 
Syracuse, New York were the participating agencies 
in the study. 

* Deep appreciation is extended to Marilyn Roths- 
child for her crucial part in this project 

8 The author administered all of the Northwestern 
tests 

‘ After 45 retests the original examiner left the em- 
ploy of the clinic. The author administered the re- 
maining 19 tests. 

5 The decision to retest the babies at 18 months 
with the Cattell was determined by two factors: (a) 
The correlation coefficient between the Cattell at 18 
months and the Stanford-Binet (Form L) at 3 years 
was .67 (Cattell, 1947, p. 49). This coefficient sug- 
gests that the Cattell at 18 months appears to be 
measuring the same factor or factors as the Stanford- 
Binet at 3 years. (b) The results of this study were 
to have definite practical implications regarding the 


BRAEN 


ounty Child Guidance Center 


\ frequency distribution of age by week for th 
abies the Northwestern is shown it 
Table 1. The mean age was 22.56 weeks with a 
ard deviation of 6.71 


! administered 


stand 
weeks 
RESULTS 

The mean IQ for the Northwestern was 
91.42 with a standard deviation of 9.54. The 
mean IQ for the Cattell was 112.00 with a 


TABLE 1 


FREQUENCY DISTRIBUTION OF AGE BY WEEK 
NORTHWESTERN-TES 


FOR 
t B SuBjects 


V = 100 


Age in Weeks I 


new 
aoa x 


16 6 
17 5 
18 } 
19 5 
20 9 
21 2 
22 3 
23 4 
24 6 
25 4 
26 1 
27 9 
2 2 
29 2 
30 4 
31 3 
32 2 
33 3 
34 3 
35 5 
36 0 





adoption testing program at the Child Guidance Cen 
ter. For this reason it was important to gather vali- 
dating data with expediency but without sacrificing 
rigor. 
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TABLE 2 


MEANS 


AND STANDARD DEVIATIONS FOR NORTHWESTERN 


AND CATTELL STANDARD 


Score IQs ror Four AGE Groups AND ToTaL Group 


Northwestern 
Age in 
Weeks N Vf SD 


13-16 24 99.50 
17-21 25 97.60 
22-27 27 103.89 
28-35 24 100.38 
13-36 100 100.42 


* Significant at the .05 level 


standard deviation of 15.21. On the basis of 
these findings it appears that the unequal 
means and standard deviations obtained with 
these tests make any direct comparison of IQs 
impossible. Further, without some sophistica- 
tion with the concept of variability there may 
be a tendency to misinterpret the meaning of 
the IQ obtained with either test. In order to 
make comparisons between the two sets of 
scores, all the IQs on the Northwestern and 
Cattell were converted to standard scores with 
a mean of 100 and a standard deviation of 10. 
Also the 100 babies were divided into four age 
groups with approximately an equal number 
of subjects in each age group. 

The converted means and standard devia- 
tions for the four age groups and the total 
group appear in Table 2. 

From Table 2 it appears that variability of 
IQ from one test to the other occurs in all 
four age periods but the ¢ tests reveal that it 
is most pronounced at the 13-16, 17-21, and 
28-35 week levels. The ¢ for the 28-35 week 
old group is artificially elevated due to the 
low ceiling on the Northwestern. By the IQ 
calculation method described by Gilliland 
(1951, p. 14) an infant of 36 weeks can only 
achieve an IQ of 112 on the Northwestern if 
he passes all 40 items. An infant of 35 weeks 
can only achieve an IQ of 115 if he passes all 
40 items, and so on. Such a state of affairs 
could serve to lower artificially the mean IQ 
on the Northwestern for the 28-35 week old 
group, which would then contribute to a de- 
ceptive difference between the means of the 
Northwestern and Cattell. 

The means and standard deviations of the 
difference between the Northwestern and Cat- 


Cattell 
— Mean 
M Difference 


103.17 3.67 
93.96 3.64 
102.59 1.30 
104.88 50 
100.00 42 


tell standard score IQs for the 
riods are reported in Table 3. 

It appears from these data that the mean 
difference in standard scores for each group 
and total age period is no less than 6 and no 
more than 10 units. However, the standard 
deviations for each age period suggest that in- 
dividual infants can vary from no differen 
in relative position on the two tests to a dii 
ference of as much as 25 standard score units 
from one test to the other. Since only 3 days 
intervened between the Northwestern and Cat- 
tell testings, these data indicate that at this 
age level, examiner, subject, and test reliabil- 
ity are difficult to achieve. 


four age pe- 


Reliability 

The reliability of the Northwestern for the 
100 babies 13—36 weeks of age was assessed 
through the odd-even method. The resulting 
coefficient corrected by the Spearman-Brown 
formula was .95 = .01. 

Table 4 shows the odd-even reliability co- 
efficients for the four age periods. 

Gilliland (1951, p. 16) reported a corrected 
odd-even coefficient of .80 computed from 


TABLE 3 


MEANS AND STANDARD 
ENCES BETWEEN 
STANDARD 


DIFFER 
CATTELL 
PERIODS 


DEVIATIONS OF THEI 
NORTHWESTERN AND 
ScorE IQs ror Four AGE 


Age in 
Weeks 


Range 


13-16 
17-21 
22-27 
8-35 





Vorthwestern Infant Intelligence Test 


rABLE 4 


NORTHWESTERN Opp-EveN RELIABILITY 
CIENTS FOR Four AGE PERIOD 


\ge in 


Weeks 


data obtained from babies 
and 36 weeks of age 
Cattell (1947, p. 49) 
cients for her test for age periods comparable 
to the Northwestern age levels. At the very 
early age levels (13 weeks and 3 months) 
both the and Cattell are less 
reliable than at the later levels, but in rela- 
tion to the Cattell at the early age period the 
Northwestern 


strument. 


214 between 13 
from the Chicago area. 
reports odd-even coeffi- 


Northwestern 


appears the more reliable in- 

In spite of the fact that the Northwestern 
permits a reliable representation of particular 
skills during the testing session, the obtained 
coefficients do not give information regarding 
consistency of performance over time. It would 
ippear then that the reliability of the test for 
predictive purposes is limited 

The reliability coefficients for each of the 
four age groups may in themselves be unre- 


liable because of the small number of subjects 


at each level. However, since the progression 
of coefficients followed closely those obtained 
with the Cattell it appeared that this factor 
was not significantly affecting the resulting 
coefficients. 


Correlational Analysis 


The between the 
Northwestern, Test B and the Cattell for the 
four age groups and the total group are re- 


correlation coefficients 


ported in Table 5. 
that there is an 
element of communality in the tests and that 


These coefficients suggest 


this communal aspect is apparent for all ages. 
The magnitude of the correlation coefficients 
remains about the same for each age period 
even though the reliability of both tests in- 
creases with age. Such a finding suggests that 


the tests have more in common at the 13-21 
week period than the 22-36 week period. 

The correlation coefficient of .58 between 
the Northwestern and Cattell for the whole 
age range may not reflect realistically the re- 
lationship between performance on the two 
tests because of the low ceiling on the North- 
western already described. In order to evalu- 
ate the effects of this artifact, the correlation 
was redone with the 28-35 week old group 
eliminated. The resulting coefficient was .62 

.05. It does not appear that the elimina- 
tion of the oldest group markedly affects the 
extent of the relation between performance 
on the two tests. 


Sex Diffe rences 


A critical ratio was computed between the 
means for 61 males and 39 females on the 
Northwestern. The resulting ratio of 1.55 in- 
dicates that the obtained differences between 
the means of the sexes could be accounted for 
by chance. 


Validity 


Validity was assessed by correlating per- 
formance of 64 subjects on both the Cattell 
and Northwestern at 13 to 36 weeks with 
their performance at 18 months of age on the 
Cattell. Table 6 shows the means and stand- 
ard deviations for the Northwestern and Cat- 
tell IQs at 13 to 36 weeks and the Cattell IQs 
at 18 months. 

The correlation coefficient between the 
Northwestern IQs at 13 to 36 weeks and the 
Cattell at 07. The co- 
efficient between the Cattell at 13 to 36 weeks 
and the Cattell at 7 


18 months was .38 


18 months was .39 + .07. 
Both coefficients are low enough to indicate 


TABLE 5 


CORRELATION COEFFICIENTS BETWEEN NORTHWESTERN 
AND CATTELL IQs For Four AGE Groups AND TOTAI 
GROUP 
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TABLE 6 


MEANS AND STANDARD DEviATIONS FOR NORTH 
WESTERN AND CATTELL IQs at 13-36 WEEKS AND 
CATTELL IQs at 18 MontTHsS 


Test N M SD 
Northwestern (13-36 weeks) 
Cattell (13-36 weeks) 
Cattell (18 months) 


90.91 
111.92 
110.58 


9.38 
16.91 
10.65 


that little faith can be placed in predictive 
statements regarding intelligence for the 13 
to 36 weeks age group. In fact only about 
14% of the variance on the Cattell at 18 
months can be accounted for by variation in 
the Northwestern and Cattell IQs at 13 to 36 
weeks. At the 13 to 36 week period, factors 
such as rapid and varying growth rates, em- 
phasis on sensorimotor skills, varying motiva- 
tion, relatively subjective administration and 
scoring procedures, and examiner unreliability 
all seem pertinent sources of uncontrolled 
variance that serve to reduce the predictive 
power of both the Northwestern and Cattell. 


SUMMARY 


This study was designed to investigate the 
reliability, validity, and certain other features 
of the Northwestern Infant Intelligence Test, 
Test B. One hundred adoptive or boarding 
home babies between the ages of 13 and 36 
weeks were tested with the Cattell. Three days 
later the Northwestern was administered. The 
Cattell was readministered to 64 of the 100 
babies when they were 18 months of age. 


Bernard B. Braen 


The general findings were: 


1. The IQs obtained on the Northwestern 
and Cattell were not directly comparable due 
to unequal means and standard deviations. 

2. The Northwestern has a low ceiling at 
the upper age levels which prevents full ex- 
pression of an infant’s developmental! skills 
between the ages of 30 and 36 weeks. 

3. Odd-even reliability for the Northwest- 
ern and Cattell is similar for the 13 to 36 
week age period but the reliability for the 
Northwestern is somewhat better than the 
Cattell at the 13-16 week period. The reli- 
ability of both tests improved with increasing 
age. 

4. There appears to be a common factor in 
both tests at the 13 to 36 week period. This 
is especially true at the 13-21 week level 
probably due to a preponderance of sensori- 
motor items on both tests for this age. 

5. There was no significant 
found on the Northwestern. 

6. The validity results indicate that predic- 
tion of a child’s intelligence level at 18 months 
is a very risky procedure when based on per- 
formance on the Northwestern and Cattell at 
the 13-36 week period. 


sex difference 
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JUDGMENTS OF ADJUSTMENT FROM TAT STORIES 
AS A FUNCTION OF EXPERIMENTALLY 
ALTERED SETS 


BERNARD LUBIN ! 


Indiana University Medical Center 


Several recent investigations have demon- 
strated the susceptibility of the projective 
techniques to situational influences (Masling, 
1960). The arousal of temporary motivational 
states (Mussen & Scodel, 1955), the relation- 
ship between the subject and the examiner 
(Bernstein, 1956), the subject’s attitude to- 
ward the test (Feldman & Graley, 1954), and 
the manner in which the test is defined (Henry 
& Rotter, 1956), produce identifiable changes 
in projective test response. These studies usu- 
ally end at the point of measuring changes in 
the protocols; the implication seems to be that 
the modified protocols would influence subse- 
quent clinical judgment. 

The clinical psychologist usually works in 
a setting in which practical decisions having 
great consequences for the individual and for 
the community must be made. Often, the re- 
ferrer requests an opinion as to an individual’s 
“adjustment.” Since the question rarely speci- 
fies a Criterion situation for the judgment and 
since the various objective measures of ad- 
justment are fairly specific (Tindall, 1955), 
the judgment is frequently based on a global 
estimate of intellectual and/or “emotional” 
controls. Psychoanalytic formulations continue 
to influence much of clinical practice; thus 
emotional control often is taken to mean con- 
trol of sexual and aggressive drives, and in- 
ferences about psychological status are based 
upon the degree and manner in which drive 
derivatives enter consciousness and behavior. 

The purpose of this investigation is to in- 
quire into some of the determinants of global 
judgments of adjustment. Specifically, the ef- 
fect of TAT stories which have been elicited 


' T wish to thank James Norton for 


rroblems of design and analysis 


issistance with 


under different instructions are judged for 
level of adjustment by clinical psychologists. 
Stories elicited under “Facilitating” instruc- 
tions (prestige suggestion which placed high 
value on spontaneity and individuality) have 
already been shown to contain a higher amount 
of sexual and aggressive expression than sto- 
ries elicited under “Neutral” or “Inhibiting” 
(prestige suggestion which placed high value 
on constraint and conformity) instructions 
(Lubin, 1960). The expectation is, therefore, 
that from the Facilitating condition 
will be judged as lower in adjustment than 
stories from either the Neutral or the In- 
hibited condition. 


stories 


METHOD 


In a previous investigation (Lubin, 1960), a ran 
dom sample of 60 male college freshmen was tested 
covariance design. The five conditions 
Card conditions TAT cards with 
high pull of sexual content and two TAT cards with 
high pull of aggressive content), and three levels of 
Instructional Set (Inhibiting, Neutral, and Facilitat 
ing). Set was produced by means of prestige sug- 
gestion: the Facilitated group was told that “nor- 
mal,” “well-adjusted” people tend to let their im 
agination go as it is stimulated by the cards; the 
Inhibited group was told that normal, well-adjusted 
people are the master of their imagination and emo- 
tions; and the Neutral group was given 
instructions.* Subjects randomly assigned to 
conditions, 10 subjects to a condition. Analyses indi- 
cated that Set produced a significant effect on sexual 
and on aggressive expression, and that the interac- 
tion between Set and Cards produced a 
effect on aggressive expression 


in a 3 


were two (two 


innocuous 
were 


significant 


2A more detailed description of the methodology 
including the complete instructions can be found in 
the report of the previous investigation 
1960) 


(Lubin, 
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Stimuli 


The two experimentally treated TAT stories pro- 
duced by each of the 60 subjects were stapled to 
gether. The stories were verbatim transcriptions and 
contained a notation as to reaction time. Within each 
envelope, the 120 stories (60 pairs) were divided into 
two sets of 30 pairs each, 60 stories told to TAT 
Cards 2 and 10, and 60 stories told to Cards 8BM 
and 20. The paired stories within each set were thor- 
oughly scrambled 


Judges 


Ten clinical psychologists, seven of whom 
sessed the PhD in clinical psychology and three who 
had a master’s degree and at least 2 years of addi- 


tional clinical experience, participated as judges 


pos- 


Procedure 


In the instructions, the judges were told that the 
stories were produced by 60 male subjects between 
the ages of 18 and 23. They were informed as to the 
TAT cards which had elicited the stories, and they 
were requested to rank the four major cues which 
they had used in rating the stories. Each judge 
received an envelope whose contents are described 
above. In addition, each envelope contained the fol- 
lowing seven-point scale together with detailed in- 
structions for its use.* 

1. Well-integrated, happy person, socially effective 

2. Only mild problems in essentially well-function- 

ing person 

Particular problems of some difficulty but social 

effectiveness maintained 

Discomfort from problems severe enough to re- 

quire therapy but ability to carry on 

Acute neurotic problems but 

tenuously preserved 


reality contact 

. Severe neurotic problems with disorganization 
Severe disturbance 
prepsychotic 


bordering on psychotic or 


The seven-point scale was adapted from one de- 
veloped and used by Dymond (1954) in an investi- 
gation of the effects of psychotherapy. Repeat~scor- 
ing reliability of this scale was found to be .94 

The two stories of each subject were assigned one 
rating by each of the 10 judges. Analysis of variance 
was conducted on the 600 ratings of adjustment 


RESULTS AND DISCUSSION 


It is important to note that the validity or 
accuracy of the ratings was not a subject of 


>A copy of the complete instructions has been de- 
posited with the American Documentation Institute 
Order Document No. 6631 from ADI Auxiliary Pub- 
lications Project, Photoduplication Service, Library 
of Congress; Washington 25, D. C., remitting in ad- 
vance $1.25 for microfilm or $1.25 for photocopies 
Make checks payable to: Chief, 
Service, Library of Congress. 


Photoduplication 


investigation. The stories had been produced 
by a sample of 60 male freshmen chosen ran- 
domly from the total population of male 
freshmen of a large university. It can be as- 
sumed that these subjects represented a sam- 
ple of relatively well-functioning individuals. 
In order to preclude a skewed distribution of 
ratings, the judges were not told that the sub- 
jects were college students. However, neither 
were they told that the subjects represented 
a malfunctioning group, such as hospital or 
clinic patients. these conditions, 


Under one 


would expect that the effects of the stories 
themselves would be maximized. 

The scatter plot revealed that the 600 rat- 
ings made by the 10 judges were distributed 
in a normal fashion over the seven scale points. 


Table 1 presents the summary of the analy- 
sis of variance of the ratings of adjustment. 
It can be seen that the Instructional Set un- 
der which the TAT stories were elicited origi- 
nally produced a significant effect on the judg- 
ments of adjustment. Neither Cards nor the 
interaction Letween Set and Cards was found 
to produce a significant effect on judgments 
of adjustment. Those aspects of the judgments 
determined by the sexual or aggressive stimu- 
lus patterns of the cards seemed to produce 
an effect of equivalent magnitude. 

It seems reasonable to conclude, therefore, 
that the subject’s attitude toward the testing 
situation can influence not only the test proto 
cols (Lubin, 1960) but also may produce an 
effect on resulting clinical judgment. 

Further information concerning the effect 
of Instructional Set upon judgments of ad- 
justment is presented in Table 2. 
was 


Since the 
so that higher scores 
represent judgments of poorer adjustment, the 
order of the means indicates that highest level 
of adjustment was rated for stories from the 
Inhibiting condition, lowest from 
the Facilitating condition, with stories from 
the Neutral condition occupying the inter- 
mediate position. The Tukey Studentized 
Range Test that the difference be- 
tween the means of the Inhibiting and Facili- 
tating conditions is 


scale constructed 


for stories 


reveals 


large enough to be ac- 
cepted with confidence, but that we cannot 
safely say where the Neutral condition falls 


in between the other two. 





Judgments of Adjustment from TAT Stories 


rABLE 1 


ANCE OF 


Bet wee 
Between ( aras 
set X ( 
Subjects Within Set 


Totals 


n Sets 


rd 
irds 


Although the lists of Story cues upon which 
the judges based their ratings were too varied 
to permit meaningful categorization and analy- 
sis, 8 of the 10 judges mentioned “amount of 
the four main 
cues which they used. The previous investiga- 
tion (Lubin, 1960) that In- 
structional Set produced a significant effect 
on both sex and aggressive expression and 
further analysis of the effect indicated that 
the effect was linear, i.e., highest expression 
of sex and aggression in the Facilitating con- 
dition, lowest in the Inhibiting condition, and 
an intermediate position for the Neutral con- 
dition. Thus there is a strong suggestion that 
judgments of adjustment based on TAT sto- 
ries are significantly influenced by the amount 


sex or aggression” as one of 


demonstrated 


of sex and aggression which is expressed. 


These findings support the observations of 
Soskin (1954) that when the psy- 
chologist is requested to make interpretations 


clinical 
based on projective test protocols, his judg- 
the direction of 
pathology. Also, Kenny and Bijou (1953) 
found that when clinical psychologists were 


ments tend to be biased in 


asked to rank TAT stories according to their 
interpretive significance, they tended to give 
greater weight to contents which represented 
expressions of the sexual and aggressive drives. 

The findings of this investigation point to 
some of the risks involved in “‘blind analysis.” 
When the subject has a Set to be spontaneous 
and to individualize himself, he expresses more 
sex and aggression in his TAT stories (Lubin, 
1960), and when the clinical psychologist, 
without knowledge of the subject’s Set, makes 
a judgment of the subject’s adjustment based 
on these same TAT stories his judgment tends 
to be biased in the direction of pathology to 


RATINGS OF ADJUSTMENT 


the extent to which sex and aggression is ex- 
pressed in the stories. 

It might be objected that in practice the 
clinical psychologist does not make judgments 
of adjustment based solely on the TAT and 
certainly not on such a small number of cards 
as were used in this study. The first part of 
the objection must be granted; such an im- 
portant judgment would be based upon a bat- 
tery of tests rather than a single instrument 
It should be noted, however, that within the 
test battery, the TAT is likely to be used in 
a variety of ways (Dana, 1956). In addition 
interpretation of the TAT in the clinical set- 
ting is more often based on global judgment 
than on the few time consuming objective 
methods of analysis which have been proposed 


SUMMARY 


In order to test the hypothesis that judg 
ments of adjustment made by clinical psy- 
the 
which a subject takes a projective test, two 


chologists are influenced by Set under 
TAT stories from each of 60 subjects were 
rated on a seven-point scale of adjustment by 
10 clinical psychologists. The stories previ- 
ously had been elicited under differential In- 
structional Sets: Inhibiting (V = 20), Neu- 
tral (NV = 20), and Facilitating (A 20) 


rABLE 2 


Set 


Inhibiting 
Neutral 


Facilitating 
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It was found that Instructional Set signifi- 
cantly influenced the ratings: most pathology 
was rated for stories from the Facilitating 
condition and least for stories from the In- 
hibiting condition. Collateral data suggested 
that the amount of sexual and aggressive ex- 
pression in the stories was the most impor- 
tant factor which influenced the ratings. 
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ORTHOPEDIC DISABILITY AS A FACTOR 
IN HUMAN-FIGURE PERCEPTION 


AURELIA LEVI 


Teachers ( 


There is perennial interest in the possible re- 
lationship between personality dynamics and 
various features of human-figure drawing. 
Though experimental studies frequently fail 
to support such a relationship, the impression 
continues that it does nevertheless exist, and 
an occasional study showing positive results 
helps to keep the impression alive. 

Some of this ambiguity of result may be 
owing to the fact that even if it be true that 
a figure-drawing is a projection of its creator’s 
body image (Machover, 1949), the projective 
process is necessarily limited by perceptual 
limitations, including perceptual prejudices 
(Postman, Bruner, & McGinnies, 1948). In- 
deed, this is the heart of the basic projective 
hypothesis. It follows then that before we 
can proceed to evaluate the peculiarities of a 


drawing—its omissions, overemphases, distor- 
tions, or whatever- 
dynamics, we would do well to establish the 
prejudicial influence of these dynamics on his 
perceptions. In other words, even assuming 
that a drawing is a projection, before we can 
attribute a 


in terms of its creator’s 


bit of elaborate overemphasis in 
the drawing to its creator’s undue preoccupa- 
tion with the part in question, we have to in- 
sert a middle step and show that his percep- 
tion—as yet uncomplicated by the act of 
bodying forth mental images through paper 
and pencil—has already been shaped by that 
particular preoccupation. This point has been 
made concisely by Silverstein and Robinson 
(1956): “The assumption that the physical 
body, the body image, and the drawn figure 
are in isomorphic relation” remains as yet un- 
justified. From their study they conclude that 
“this one-to-one relationship does not seem to 
exist” (p. 340). 

It is only with the existence of prejudicial 


ollege, Columbia University 


influences on the perceptions of several diag- 
nostic groups that the present study is con- 
cerned. The final step, relating these percep- 
tions to the recreative process of figure-draw- 
ing, is beyond the scope of this study. Our 
task will be to separate Silverstein and Robin- 
son’s three isomorphs into two groups of two 
and investigate a hypothesized one-to-one re- 
lation between the first two, the physical body 
and perceptions of the human figure (the body 
image). Our diagnostic groups differ from 
each other with respect to orthopedic dis- 
ability—a comparatively simple, 
distinguishing criterion 

This study hypothesizes a one-to-one rela- 
tion between the physical disability and per- 
ceptions of the human figure, and that a par- 
ticular orthopedic disability acts as a dislo- 
cating influence on perceptions of the relevant 
body part. Specifically, it is hypothesized that 
subjects with disability of the legs will be un- 
usually sensitive to the legs in drawings of 
the human figure; and that subjects with dis- 
ability of the arms will be unusually sensitive 
to the arms in drawings. 

Since the group of back disabilities is a 
less homogeneous group than the other two, 
and has less well-defined etiologies which have 
been thought to be of psychic origin, it is fur- 
ther hypothesized that this group will show 
a greater resemblance to the characteristics of 
a control group of nondisabled. 


objec tive, 


PROCEDURE 

Subjects 

The experimental group was composed of 38 sub- 
jects with orthopedic disabilities as follows: (a) 12 
with traumatic disability (fractures, amputations) to 
the arms, and no other disability; (6) 13 with trau- 
matic disability to the legs, and no other disability ; 
(c) 13 with a variety of low-back disabilities (spinal 


253 





urelia Levi 


rABLE 1 


SEX, AND EpucaTion oF Co 
AND DISABLED SUBJECTS 


Age Education 


Subjects Range Range 
19—68 2-20 


16-67 2 20 


Control 
Disabled 


back; 
un- 


the lower 
bac k 


determined origin), and no other disability 


fusion; laminectomy; arthritis of 


uncomplicated, nonradiating low pain of 


The con 
trol group was composed of 35 subjects without any 


orthopedic disability. As shown in Table 1, the age 


of the experimental group ranged between 16 and 67, 
mean age 41.9, and of the control group between 19 


and 68, mean age 40.8. The number of vears of edu- 


cation for the experimental group ranged from 0 to 


20, mean 8.5 (median 9), and for the control group 


between 2 and 20, mean 9.0 (median 10) 


A } 


Units 


Score: L 
Score L 


Scere: B 


L 


Standard 


Score: A Score: B 


Fic. 1. Model figure, standard body units, 
examples of stick figures containing one 
unit each. 


and five 
standard 


Instrument 


The instrument was 
stick figures (af 


1955) 


a pair ol 
Hardyck, 
model stick figure 
of three parts: an ul 
and-spine unit. The three 
considered to be the 

each of the 54 paired figu 
ird unit, the 
ure 1). Each stick 
the left 
are so paired that 18 of 
unit, 18 contain the 
the standard he 
figures on one card cont 


and a 
All thi 
arm 

are 

other two 
figu 
once on and on 
arm 
contain 
irom an objective 
the 


model as ever 


exactly 


Sam 


Method 


Each subject was 
model figure, on the larg 
you other figures in 


out which one of each p 


or resembles the 
subject are the number 
each standard unit, the 
possible for any unit be 
For example, a fig 


the standa 


ing ( 
Arm score if 


standard arm unit; simi 


Back score, if 


standard head-and-spins 


rAl 


( OMPARISON 
GROU!I 


Croup 


4 Choices 
Group A 
Control 

L Choices 
Group L 
Contro!] 

B Choices 


Group B 
Control 


t test cor 
the 


unequal 


case |! 


Walker 


single large 


model mo 


the standa 


of 27 cards, each bearing 
1954; Sarbin & 


, 
card bearing a 


ter Sarbin, 


stick figures are composed 
l head 
parts making up the model 


tandard units of 


t, a leg unit, and a 


reterenc 

es contains only one stand 
(Fig 
twice in the set, 
right. The 
the standard 
unit, and 18 
unit, but 
iny identical unit 
view, 


inits being variants 


ippears 
the figures 


on 
them contain 


tandard leg 


and-spine no 
Thus, 


bears 


figure 
to the 


each 
resemblance 


follows 


“Here is a 
to st 


I am going 10W 


I want you to pick 
like the model, 


tor 


OKS more 


losely.” Scores each 


choices he made involving 
ximum number of choices 
18, and the minimum bx 
choice is credited to the 
the 


the 


1 unit it contains is 
is credited to 
it contains 


is the 


Figure 1 


LI 


F THREE Disa 
NTROL GRouP 
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TABLE 3 


COMPARISON OF VARIANCES OF 


GROUPS WITH 


THREE Disapiity 
GROUP AND 
with Eacu OTHER 


4 CONTROI 


Group 


\ Choices 
Group A 


( ontrol 


20.7 
20.29 
L Choices 

Group L 

Control 
B Choices 

Group B 


Group A 


I 


Group L 


Group B 


Group 


Group A 
Group B 


RESULTS 


As was hypothesized, the arm-disabled and 
leg-disabled groups show a perceptual sensi- 
tiveness that varies with the site of disability. 
For these two groups, means differ from those 
of nondisabled persons (Table 2), at a high 
degree of significance (.01 and .001, respec- 
tively). 

As was hypothesized, the group of back dis- 
abilities shows no such perceptual difference 
from the nondisabled. The mean B Score for 
the back-disabled group does not differ sig- 
nificantly from the control mean (Table 2). 

An unexpected finding is that the leg-dis- 
abled group is far more homogeneous than 
any of the others (Table 3). 


DISCUSSION 


The main finding of the study—that there 
is a one-to-one relationship between certain 
types of physical disability and perceptions 
of the human figure—provides a necessary, 
though not sufficient, step for a demonstra- 
tion that a figure-drawing is a projection of 
its creator’s body image. 
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That this one-to-one relationship is not a 
simple, across-the-board condition whose ex- 
istence can be assumed for every kind of 
physical idiosyncrasy may be seen from the 
fact that the group of back disabilities, whose 
origins and symptomatology are much less 
clearcut than those of the other two groups, 
shows a less distinct perceptual prejudice; 
and from the differing the 
groups. 

It may be that the degree to which the dis- 
ability is visually apparent immediately, ex- 
erts an important influence. Under such a 
scheme, low-back would be at the 
low end of a continuum of immediately ap- 
parent crippling; arm fractures and even am- 
putations of the hand or arm—especially when 
fitted with a cosmetic prosthesis—while be- 
coming apparent enough in any situation in- 
volving interaction, would nevertheless be 
more readily concealable than the impaired 
and distorted ambulation of the leg-disability 
group. Such an explanation might account for 
the findings with respect to the variability of 
the groups as well as their perceptual preju- 
dice. 


variability of 


sufferers 


It should be noted that the results of this 
study may possibly be dependent on the fact 
that the control group and all three experi- 
mental groups were fairly homogeneous with 
respect to age, education, and socioeconomi 
level. To the extent to which differences in 
these factors might exert competing influences 

as, for instance, the possibility that an ex- 
tremely gifted, high 
might bring into play an opposing percep 
tual prejudice akin to repression (Postman 
Bruner, & McGinnies, 1948)—the pattern of 
results might be expected to vary; such an 


(Achievement subject 


effect would, of course, be a matter for clari- 
fication through further research. 


SUMMARY 


This study compared three physical-disabil- 
ity groups with a group of nondisabled con- 
trols for their perceptual reactions to a struc- 
tured test involving resemblances among sche- 
matized human-figure drawings. 

It was ascertained that subjects with arm 
disability are particularly sensitive to the 
arms in a drawing, and that subjects with leg 
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disability are particularly sensitive to the legs 
in a drawing. Subjects with a variety of low- 
back ailments—often thought to be psycho- 
genic—appear to be closer to the nondisabled 
control in their reactions than do either of the 
other two groups. 


The group of leg disabilities is a much more 
homogeneous group than either the other two 
disabled groups or the controls. 


A possible explanation that would account 
for all these findings is offered on the basis of 
the degree to which the disability is instantly 
apparent. 

It is considered that this study gives sup- 
port to a hypothesized one-to-one relationship 
between the physical body and the body 
image. 


Aurelia Levi 


REFERENCES 


jection in the 
Springfield, IIL., 


MaAcHOver, Karen. Personality pp 
drawing of the human figure 
Charles C Thomas, 1949 

PostMaN, L., Bruner, G., 
sonal values as selective 
abnorm. soc. Psychol., 

SarBin, T. R. Role 
Handbook of 


& McGrwynies, E. Per 

factors in perception. J 

1948, 43, 142-154 

theory. In G. Lindzey 

ocial psychology. Vol. 1 
Mass.: Addison-Wesley, 1954. Pp. 223-258 

SarBin, T. R., & Harpycx, C. D. Conformance in 
role perception as a personality variable. J. consult 
Psychol., 1955, 19, 109-111 

Sttverstern, A. B., & Rosryson, H. A. The repre 
sentation of orthopedic disability in children’s fig- 
ure drawings. J. consult. Psychol., 1956, 20, 333 
341 

WaLker, Heten M., & Lev, J 
New York: Holt, 19 


(Ed.) 


Cambridge, 


Statistical inference 


(Received Ma 18, 196 





Journal of 


Consulting Psychology 
1961, Vol. 25, No. 3 57 


No. 3, 257-259 


TAT PERFORMANCE AS A FUNCTION OF ANXIETY 
AND COPING-AVOIDING BEHAVIOR ' 


E. JERRY PHARES 


Kansas State University 


A characteristic often associated with anx- 
lety is its potential for generalization. Based 
perhaps on either qualitative similarity or 
identical elements, anxiety may become at- 
tached not only to the original arousal situa- 
tion but also to other situations. 

One such situation is that of projective test- 
ing. Anxious patients frequently seem to pro- 
ject threat into the test stimuli. For example, 
Rotter (1940, 1946) has stated that anxiety 
on the TAT is revealed in plots that empha- 
size sudden physical accidents and emotional 
trauma. Similarly, Schwartz (1955), from a 
Freudian viewpoint, has related the expres- 
sion of castration anxiety on the TAT to 
themes in which occur genital or other body 
injury or loss, sexual or personal inadequacy, 
intrapsychic or extrapsychic threat, and loss 
of cathected objects. 

However, to expect every anxious patient 
to produce such themes would seem a too sim- 
ple approach to the problem. Not all anxious 
people handle their anxiety alike. Some sub- 
jects, having perceived a threatening stimu- 
lus, become evasive and produce bland sto- 
ries, while others respond directly and create 
themes indicative of threat. The distinction 
drawn here parallels that of the perceptual 
defense-sensitization dimension (Carpenter, 
Weiner, & Carpenter, 1956). 

In a similar vein, Weisskopf-Joelson, Asher, 
and Albrecht (1957) investigated “label- 
avoidance” as a manifestation of repression. 
They found some support for the hypothesis 
that people who strongly repress an impulse 

1 Based on a paper delivered at the Annual Meet- 
ing of the Midwestern Psychological Association, St 
Louis, Missouri, April 29, 1960 

This study was supported in part by a grant from 
the Faculty Research Fund of Kansas State Uni- 
versity 


tend to avoid expressing this impulse in situa- 
tions carrying its label to a high degree. Ap- 
plied to the TAT this could mean that sub- 
jects with strongly repressed sexual or aggres- 
sive impulses would frequently fail to give 
sexual or aggressive themes to pictures sug- 
gestive of an aggressive or sexual content. 
The hypothesis of the present study is that 
high anxious subjects will show greater pref- 
erence for TAT themes involving accident, 
threat, or trauma than will low anxious sub- 
jects when matched for tendency to evade or 
with threatening stimuli. The present 
study resembles one by Goodstein (1954) who 
found a nonsignificant relationship between 
anxiety and preference for anxiety-like TAT 
statements without 
avoiding tendencies 


( ope 


controlling for coping- 


METHOD 


Selection of High and Low Anxious Subjects 


The Taylor anxiety (1953) was 
anxious subjects. From 
263 general psychology students who took a 50 item 
form of the A scale during pre-enrollment, 25 high 
anxious 


scale (A scale) 


used to select high and low 


2 


and 25 low anxious fe 
males (scores under 7) were selected 


(scores above 18) 


Determination of Coping-Avoiding Tendencies 


This technique stems from a distinction between 
copers and avoiders recently made by Mainord 
(1956). He demonstrated that copers recalled more 
nonsense syllables associated with disturbing words 
than with neutral words, and avoiders recalled more 
syllables associated with neutral words. Goldstein 
(1959) utilized the same distinction in predicting dif- 
ferential responses to fear-arousing propaganda. Both 
of these investigations used an incomplete sentences 
blank (ISB) consisting of 40 critical and 20 filler 
stems. The former have direct sexual and aggressive 
implications. Critical items are scored on a three- 
point scale in terms of specificity, strength, and arbi 


trariness of response. A subject’s score is the sum of 
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a high score 
indicates coping tendencies. Copers are thus subjects 
who respond most directly to the implications of the 
stems while who most 
evasively. Using this technique, two judges independ 
ently scored 13 protocols from 
with 89% 


the weights assigned to each item and 


avoiders are those respond 
a pretest population 
scoring agreement 


TAT Measure 


To increase objectivity, the usual method of ad 
ministering and scoring the TAT was modified 
cards (4, 6BM, 7BM, 13MF, 14, 17BM, 
were presented, each accompanied by six 


Seven 
and 20) 
themes 
four neutral and two embodying threat, accident, or 
trauma. For example, with 6BM: 


neutral 
to apply 


This young man has come to the mansion 
for the job of gardener. He has been 
asked to wait by this elderly maid. So, hat in hand, 
he waits 
threat—The son has just had to tell his 
that all of their savings have been wiped out 
stocks are worthless that they invested in 
are both stunned by this development 
left 


mother 

The 
They 
Nothing is 


Each subject rank ordered the alternative themes 
in terms of how well they fitted the card. The sub- 
ject’s score is the sum of the ranks of the 14 threat 
themes. The higher the score, the lower the prefer 
ence ranking for threat themes 

Each subject was tested individually. The modified 
TAT was administered, followed by the coping-avoid- 
ing ISB. Subjects responded anonymously to both 


TAT and ISB 


RESULTS AND DISCUSSION 


From the foregoing procedure it was pos- 
sible to match high and low anxiety subjects 
with respect to ISB scores. The N of each 
group was 19 and the mean ISB score for each 
group was 35.0. 

The mean TAT score for the high anxious 
group was 47.2 (SD 4.7), and for the 
anxious group 53.6 (SD 6.5). Applying 
test * the difference between the means is 
nificant and in the predicted direction (¢ 
p = .001). 

The results bear out the hypothesis that 
anxious people see more threat in TAT cards 
than do nonanxious people when their tech- 
nique for defending against anxiety is con- 
trolled. Goodstein (1954) reported a non- 
significant trend in a similar direction due 
perhaps to a lack of control for the coping- 
avoiding dimension. Although a portion of his 


low 


2A one-tailed test of the 
used 


distribution of t was 


E. Jerry Phares 


data was based on 10 anxious and 10 non- 
anxious subjects while the present data is 
based on 19 pairs of subjects, the discrepancy 
between probability estimates in the two stud- 
ies is much greater than would be expected 
merely by augmenting A 

Thus, it seems probable that in a given un- 
selected population of anxious subjects, some 
who are copers and others avoiders, the con- 
firmation of an otherwise perfectly logical and 
tenable hypothesis might be prevented. For 
example, Lesser (1959) demonstrated that 
under conditions of high anxiety about ag- 
gression, the intercorrelations among various 
measures of aggression were significantly lower 
than in the case of low anxiety over aggres- 
sion. A behavior occurs not simply as a func- 
tion of one variable but as a function of the 
relationship among several variables. 

Generalization of the present results is, of 
course, limited by the sex composition of the 
population and by the modified TAT 
cedure. 


pro- 


SUMMARY 


This study hypothesized that anxious sub- 
jects ‘will show greater preference for TAT 
themes involving accident, threat, or trauma 
than will nonanxious subjects when matched 
for tendency to evade or cope with threaten- 
ing stimuli. 

Twenty-five subjects high on the Taylor 
anxiety scale and 25 low on that scale were 
administered a modified TAT and a special 
ISB which measures coping-avoiding tenden- 
cies. Subjects rank ordered neutral and threat- 
ening themes which accompanied seven TAT 
cards in terms of how well they fitted the 
cards. 

With this procedure, 19 pairs of high and 
low anxiety female subjects matched for cop- 
ing scores confirmed the hypothesis at a sta- 
tistically significant level. 
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This study was undertaken in order to ex- 
plore possible relationships between two dif- 
ferent general approaches to the description 
and measurement of verbal interview behav- 
ior. One widely applied frame of reference di- 
rects attention to the communication aspects 
of verbal behavior, that is, to some symbolic 
dimension of the content of the words spoken, 
using content analysis to define and quantify 
its variables. A second frame of reference 
focuses upon quantitative temporal charac- 
teristics of interview interactions, utilizing 
measures such as number and duration of ut- 
terances, duration of silences, etc. 

The few studies which have incorporated 
measures of both content and temporally de- 
fined variables have usually indicated the 
greater discriminatory power of the latter. 
For example, Page (1953), Lennard (1955), 
Goldman-Eisler (1952), and others have 
found that quantity or tempo of verbal out- 
put is both more stable and more highly cor- 
related with various criteria of personal ad- 
justment or psychotherapeutic success than 
are content-derived variables. 

Lundy (1950) found that a single patient 
seen concurrently by two therapists differing 
in therapeutic technique showed no difference 


1 This investigation was supported by a research 
grant (M-735) from the National Institute of Men- 
tal Health, of the National Institutes of Health, 
United States Public Health Service. This paper is 
based in part on a dissertation, under the direction 
of Frederick H. Kanfer, submitted by the senior au- 
thor in partial fulfillment of the requirements for the 
degree of doctor of philosophy at Washington Uni- 
versity. The data were collected while all of the in- 
vestigators were at Washington University. 


in his responses to the two therapists in meas- 
ures of content (Distress-Relief Quotient, 
Raimy’s self-references, etc.). However, dif- 
ferences in tempo of interaction in the two 
sets of therapeutic interviews were apparent 
in the protocols. As a secondary aspect of a 
subsequent study, Lundy (1955) compared 
the temporal variables with clinical estimates 
of occurrence of 
topical content in 


emotional and 
interviews. The re- 
sults indicated that the contentual and tem- 
poral variables were related. 


significant 


“key 9? 


The present study involves the direct in- 
vestigation of relationships between more pre- 
cisely measured content variables and the 
temporal measures of the Interaction Chrono- 
graph. This instrument and Chapple’s un- 
derlying interaction theory (Chapple, 1949; 
Chapple & Arensberg, 1940) constitute an ex- 
tensive, systematically developed attempt to 
describe temporal phenomena in verbal be- 
havior and to investigate their significance. 
Definitions of the Interaction Chronograph 
variables involved in the present study are 
given in Table 1. Without more 
knowledge on part of the 
“meaning” of 


extensive 
subtle 
the several inter- 
correlated Interaction Chronograph variables 
(Matarazzo, Saslow, & Hare, 1958), precise 
hypotheses as to specifically which content 
and interactional variables would be related 


our dif- 


ferences in 


could not be formulated. However, tentative 
hypotheses regarding the general types of re- 
lationships to be expected on a “face validity” 
basis were obviously the determinants of the 
kinds of content categories selected for use, 
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Descriptive Content and Interaction Behavior 


TABLE 1 


INTERACTION CHRONOGRAPH VARIABLES 


Pt.’s Units The number of times the patient 


acted 


Pt.’s Tempo The average duration of each action 


plus its following inaction (silence), 
as a single measure 
Pt.’s Silence The average duration of the pa 


tient’s silences 


The durations of the 
terruptions minus 


Pt.’s Adjustment patient’s in 
the durations of 
his latencies in divided 


by Pt 


responding, 


’s Units 


Pt.’s Initiative 


1es, out of the 
number of opportunities 
(usually 12) in Period 2, in which 
the patient acted again 


The percentage ot ti 


available 


within a 
15-second limit) following his own 


last action 
The average length of time in Period 
2 that the 
taking the initiative following his 


Pt.’s Quickness 


patient waited before 


own last action 


Period 4 


“talked down” the 


The number of times in 
that the patient 


interviewer 


Pt.’s Dominance 


minus the number of 
times the interviewer talked down 
the patient, divided by the number 
of Pts.’s Units in that period 


and are illustrated below in the description of 
the content system. 


PROCEDURE 
Forty patients randomly selected from new re- 
ferrals to the Psychiatric Outpatient Clinic of a large 
urban medical center were interviewed by the same 
psychiatrist according to the published rules of the 
partially standardized interview which is used with 
the Interaction Chronograph in order that the inter- 
viewer may serve as a partially controlled or in- 
dependent variable during each of five predefined 
subperiods of the interview (Matarazzo, Saslow, & 
Matarazzo, 1956; Saslow & Matarazzo, 1959). Each 
interview was observed from an adjoining room 
through a one-way mirror by the same observer, 
who recorded the ongoing interaction on an Inter- 
action Chronograph and made a verbatim sound re- 
cording on a Gray Audograph. Ten of the 40 pa- 
tients were omitted from the final experimental sam- 
ple because of deficiencies in the sound recordings, 
or, in a few cases, because of the presence of acute 
psychosis which resulted in confused, incoherent in- 
terview content. The 30 remaining patients served as 
the subjects of the present study. Of these, 17 were 
female, 13 were male. Their ages ranged from 20 to 
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with a median age of 35.5 
quent diagnoses were 
neurosis 


The most fre- 
hysteria (eight cases), anxiety 
(seven cases), depression (six 
schizoid personality (four cases) 

The sound recordings were carefully and repeatedly 
monitored by the typist and a judge, until a high de- 
gree of recording transcription fidelity was achieved 
Since the reliability of the content scoring process 
had been demonstrated on an independent sample of 
transcripts shortly before (Phillips, 1957), the final 
transcripts were unitized and then categorized, unit 
by unit, by one experimenter 

The content aspects of the verbal interview beha\ 
ior were defined and quantified according to the cate 
gory system schematically diagrammed in Table 2 
The system and its development have been described 
in Phillips (1957). It represents an adaptation and 
extension of the Interpersonal System devised by 
Freedman, Leary, Ossorio, and Coffey (1951), C and 
C’ in Table 2, which consists of a circular continuum 
of 16 categories, representing qualitative blendings of 
two orthogonal dimensions, love-hate and domi 
nance-submission. A seventeenth category was added 
for coding of units unscorable or neutral within the 
Interpersonal System 


61 years, 


cases) and 


In addition to the Interpersonal System, several 
other dimensions were coded in order to achieve com 
pleteness of coverage and more general applicability 
These other dimensions were operationally defined 
by: (a) coding of units without interpersonal refer- 
ence, D and D’ in Table 2, such as “I sat down and 
ate,” “My head aches,” etc.; (b) coding of the actor 
or subject of a description and of the receiver of ac- 
tion or attitude, if any, A and E in Table 2; and (c) 
coding of the general topic discussed, e.g., marriage, 
finances, symptoms, etc. Provision was also made for 
differentiation of descriptions of motor acts, C and D, 
from description of nonmotor states of being, think- 
ing, feeling, etc. C’ and D’. For example, “I yelled 
at her” is an interpersonal act, C, while “I was angry 
at her” is coded as an interpersonal state, C’; “I ate 
dinner” is a noninterpersonal act, D, while “I felt 
sick” is a noninterpersonal state, D’. Those units 
which referred directly to the ongoing interview in- 
teraction and had no referent outside of the cur- 
rent situation (eg., “Thank you, doctor,” “I agree 
with you”) were classified separately, according to 
their function, within categories adapted from Bales 
(1950), II in Table 2. A residual category was uti 
lized for unscorable units, III 

The resulting system is of a pyramidal nature, in 
volving several series of mutually exclusive cate 
gories. It was considered particularly suited to the 
purposes of this study because of its interpersonal 
interactional emphasis and its use of content vari- 
ables seemingly parallel to the temporal variables of 
the Interaction Chronograph. For example, it was 
expected that one or more of the Interaction Chrono 
graph measures of verbal “output” (number and 
duration of utterances, etc.) would be related to the 
frequency of content describing the self as physically 
active, Self C+D in Table 2. Since the former 
measures of verbal output in the interview have been 
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rABLE 3 


CORRELATIONS BETWEEN CONTENT CATEGORIES 
Pt Pt.’s 
Content Units Tempo 
Description I 
Self Subject (A 
Self Acts (D + ( 38 
Self Interpersonal (C + (¢ 34 
Self Dominant-Hostile Qua 38 
Self Submissive-Hostile Qua 
Self Dominant-Positive Qua 
Self Submissive 
Self Positive 
Self Noninterpersonal (D D 37 
™ lf Nor interpe rsonal 
Negative (D + D’ 3 
Other Interpersonal (C + ( 
Other Noninterpersona!l (D + D 
All Interpersonal (C + ( 1) 
Direct (II 34 31 
Number Topics (F 3] 
Number Persons A+ I 
ve Os ¢ 0 } 


rentheses), are included. Table 3 also does not 


include content categories which are essen- 
tially reciprocals of those presented, nor some 
Interaction Chronograph variables which have 
been shown to be highly correlated with those 
included and whose correlations with content 
categories essentially duplicated these results 
Pt.’s Units. Because of the relatively fixed 
length of the interview and of the interview- 
er’s utterances, Pt.’s Units is highly corre- 
lated in a negative direction with measures 
(Pt.’s Action, Pt.’s Tempo, and Pt.’s Activ- 
ity) of the duration of the patient’s utter- 
ances. In the present sample it correlates —.70 
with Pt.’s Action and may be assumed to be 
one of the more stable and representative 
measures of the general level of the patient’s 
verbal output because, as a frequency meas- 
ure, it is less affected by a few extremely short 
or long utterances than are the duration meas- 


The complete matrices for content vs. Interaction 
Chronograph correlations have been deposited with 
ADI. Order Document No. 6641 from ADI Auxiliary 
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Make checks payable to: Chief, Photodupli 


cation Service Congress 


brary of Congress; 
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, Library of 
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Interaction Chronograph 


Pt.’s Pt.’s Pt.’s Pt.’s Pt 
nce Adj Init 


$2 34 1) 
48 x6 


1) 


ures.* The results shown in the first column of 
lable 3 reveal that the patient who has fewer 
Pt.’s Units, that is, who speaks relatively in- 
frequently (but with longer durations per ut 


terance), is shown by these correlations with 


content measures to describe himself as rela 
tively: more active in his daily living, more 
oriented toward interactions with other peo- 


ple, less concerned with his own solitary ex- 
periences, interested in a wider variety of 
events in daily living, and less prone to evade 
description of himself in general. 

Pt.’s Tempo. This is highly related to Pt.’s 
Units (7 .84 in this sample) since the 
less frequently a patient speaks in the stand- 
ardized interview, the longer of necessity will 
be his Tempo or “cycle” 
the 


of speaking. Hence 


relationships shown for Pt.’s Tempo 
should be considered in close conjunction 
with those for Pt.’s Units. As indicated in 
*The fact that these highly related measures of 
verbal activity level do, however, at times differ 
somewhat in their relationships With external behav 
ior points out the necessity for retaining all of the 
Interaction Chronograph variables despite some high 


intercorrelations, until the areas of their overlap and 
independence have been more completely defined and 
explored 
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Table 3, the longer the average duration of a 
patient’s Tempo, the more he describes him- 
self as dominant-hostile in his dealings with 
and attitudes toward other persons. 

Pt.’s Silence. This is negatively related to 
the percentage of content units which are de- 
scriptive in nature and is positively related to 
the relative frequency of direct interaction 
with the interviewer. One might hypothesize 
that both silence behavior and remarks made 
to the interviewer represent “resistance” or 
avoidance tactics, since the questions asked of 
the patients are nondirective ones calling for 
description of their general life patterns and 
do not deal with events within the interview. 
This hypothesis is also supported by the tend- 
ency noted above for the number of (short) 
utterances to be inversely related to such di- 
rect discussion with the interviewer, as well as 
by the trend for longer silences to be accom- 
panied by the introduction of fewer different 
topics. 

Pt.’s Adjustment. Since all patients hesi- 
tated before responding for longer durations 
than they interrupted, in analyzing the data, 
the minus sign was omitted, so that high 
scores for Pt.’s Adjustment indicate greater 
relative latency of response. The results in 
Table 3 show that patients with high Pt.’s 
Adjustment scores can be considered to be 
almost opposite to those with fewer (but 
longer) Pt.’s Units. Thus, “maladjustment” is 
negatively related to the relative frequency of 
description of the self as active and to the 
number of different other persons mentioned, 
while it is positively related to the relative 
amount of talk about the self. Further, the 
“slow responders” relatively less frequently 
describe the acts or the attitudes, both inter- 
personal and noninterpersonal, of other peo- 
ple: when they do describe their own in- 
terpersonal attitudes or dealings with other 
people, they relatively more frequently char- 
acterize themselves as taking a submissive 
role, probably one which is also hostile. They 
also mention fewer negative noninterpersonal 
things about themselves while tending to focus 
more upon the noninterpersonal aspects of 
their lives in general. Similar to Pt.’s Silence, 
this latency-of-response measure tends to be 
negatively associated with amount of descrip- 
tive content and positively related to relative 
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frequency of direct interaction with the inter- 
viewer, suppérting the hypothesis that this 
distinction between description and interac- 
tion with the interviewer represents avoidance 
in the face of difficulty in communicating. 

The similar nature of the content category 
correlates of Pt.’s Units and Pt.’s Adjustment 
suggests what might be termed an “outward” 
or “other-directed” orientation in those pa- 
tients who speak a fewer number of times and 
also in those who speak with a shorter latency 
of response. Patients who speak more times 
and those who wait longer before speaking 
have content which focuses more upon them- 
selves, and thus might be termed “inward”’ 
or “self-directed.” This interpretation is sup- 
ported by our recent finding that schizo- 
phrenic patients, who might be thought of as 
being at the extreme pole on a continuum of 
inward-directed vs. outward-directed orienta- 
tion, speak a significantly greater number of 
times (but in shorter average utterances), and 
with much longer average latencies before re- 
sponding, compared to normals (Matarazzo & 
Saslow, 1961). 

Pt.’s Initiative. The more the patient shows 
temporal interactional initiative (speaks again 
following his own last utterance), the more he 
also shows a form of “initiative” in raising 
new topics (and the broader one might there- 
fore infer his interests or concerns to be). A 
similar but nonsignificant trend is shown for 
the total number of different persons men- 
tioned. 

Pt.’s Quickness. These results are supported 
by the finding that another temporal variable 
has similar correlations with the number of 
topics and number of persons mentioned. Un- 
like Pt.’s Initiative, however, Pt.’s Quickness 
is similar to Pt.’s Silence in being related sig- 
nificantly to the relative amount of descrip- 
tive content vs. interaction with the inter- 
viewer. Thus Quickness seems to have two 
components: one, the readiness to com- 
municate in patients who take the initiative 
rapidly, a covariation which is similar to that 
found for Silence; and secondly, a component 
which, like the relationship found for Initia- 
tive, is related to the broader range of con- 
cerns (more people and more topics) of these 
same patients (with the shorter Quickness 
durations). 
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Pt.’s Dominance. This is directly related to 
the relative frequency with which the patient 
talks about people other than himself as well 
as to the relative concern he shows for the 
noninterpersonal feelings and behavior of 
others. Particularly striking is the finding 
that the more dominant the patient is in his 
temporal interview behavior, the more he de- 
scribes himself in content as dominant-posi- 
tive (e.g., teaching, helping, advising, protect- 
ing, etc.) in his attitudes and dealings with 
other people. Thus it seems that for the pa- 
tient whose interpersonal style is described (in 
content) as more dominant, the interruption 
behavior on the part of the interviewer in 
Period 4 is animating and challenging, rather 
than defeating. The findings for Pt.’s Domi- 
nance are very similar to those for Pt.’s Units, 
the content correlates of which also empha- 
sized relatively more discussion of others. 
However, patients with higher verbal output, 
as defined by Pt.’s Tempo, more frequently de- 
scribed themselves as dominant-Aostile rather 
than dominant-positive in their own interper- 
sonal roles. 


DIsCUSSION AND SUMMARY 

The results seem both encouraging in an 
exploratory study relating two quite disparate 
phenomena of interview behavior, and sug- 
gestive in the new meaningfulness which they 
add to the Interaction Chronograph variables 
by indicating relationships which seem inher- 
ently plausible and internally consistent. They 
provide a foundation for an approach to per- 
sonality which combines content and temporal 
variables, and suggest personality dimensions 
which, underlying as they do at least two 
quite different spheres of behavior, may be 
particularly pervasive and consistent. 

Viewed as a whole, the data presented in 
Table 3 suggest that patients who speak less 
often. who are faster to respond, and more 
dominant in the interview (that is, who have 
fewer and hence longer Units, shorter Pt.’s 
Adjustments, and more Dominance in the in- 
terruption stress period) have interview con- 


* There is no relationship in this sample between 
Pt.’s Units and 


Dominance (r= 27, not signifi- 
cant), while Dominance and Pt.’s Adjustment are 
negatively correlated (r = 45). Pt.’s Adjustment is 


positively correlated with Pt.’s 


(r 45) 


Units, however 
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tent which is relatively more oriented towards 
other people and towards interpersonal inter- 
action, with social roles which are relatively 
more frequently described by them as domi- 
nant, either in a paternalistic or a hostile 
fashion. On the other hand, the correlations 
imply that the more a patient loses or sub- 
mits to interruptions, is hesitant in speaking, 
and is less active verbally, the more his con- 
tent emphasizes his own noninterpersonal con- 
cerns rather than interaction with others, and 
the more submissively hostile is his self-de- 
scribed role with other people. 

In conjunction with the results of other va- 
lidity-oriented studies of interview behavior 
correlates (Hare, Waxler, Saslow, & Mata- 
razzo, 1960; Matarazzo, Matarazzo, Saslow, 
& Phillips, 1958; Matarazzo & Saslow, 1961), 
the present findings are a beginning at de- 
scribing the characteristics which suggest a 
significant and cohesive description of the pa- 
tient and how he interacts with others, viewed 
both from the subjective (content-inferred) 
and objective (temporally measured behav- 
ior) levels of observation. The major impli- 
cation of these results bears upon the degree 
of generalizability of the Interaction Chrono- 
graph constructs and hence has to do with 
their concurrent (and, more remotely, con- 
struct) validity. 

The correlations shown in Table 3 are small 
in the sense of accounting for relatively little 
of the variance, although respectable for com- 
plex personality variables. Further, the com- 
plete correlation matrix contained 336 r’s (28 
content categories vs. 12 Interaction Chrono- 
graph variables), of which 28 were significant 
at the .05 level or better. Since 17 would have 
been expected to be significant at this level 
by chance alone, a replication study is being 
undertaken to determine whether the present 
findings can be cross-validated with another 
sample of subjects. A number of hypotheses 
are suggested by the results of this validity- 
oriented study which can be pursued in fu- 
ture investigations if the present findings are 
borne out in the replication study. 
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THE CONCEPT OF NORMALITY: 
A REPLY TO FREIDES 


MAURICE 


Univer uiv of Texa 


In an interesting paper published in this 
journal, Freides (1960) presses for the elimi- 
nation of the concept of normality on the 
grounds that there is little agreement concern- 
ing its definition, that it ascribes absolute, 
yet culture bound, patterns of behavior and 
that it disregards the flexible interactions be- 
tween personality and circumstances. Al- 
though Freides concerns himself nearly ex- 
clusively with the idealist-adjustment view of 
normality, other approaches such as the sta- 
tistical-average conception are likewise dis- 
missed with the conclusion that “for purposes 
of scientific theory and also for practical clini- 
cal purposes” our emphasis must shift away 
from considerations of normality (or patho- 
genicity) and toward a greater concern with 
“the potentialities of every person under the 
proper conditions.”’ Such a position, it will be 
argued here, springs from too narrow a con- 
ceptualization of both scientific theory and 
clinical purposes. It has, moreover, some un- 
fortunate implications for future research de- 
velopments and basic orientations in clinical 
psychology. 

Although normality is (today) an admit- 
tedly low powered concept, judgments of nor- 
mality-pathogenicity hover in the background 
of many diagnostic and therapeutic decisions. 
It seems that, ambiguous and ill- 
defined a conception of normality our pro- 
fessional behavior reflects, this term nonethe- 
less behaves the way a good construct should 
(Beck, 1953): as a host 
of personality characteristics and behavioral 
tendencies, as a gross predictor of a person’s 
future behavior and as an object of search. 
This last role will be made clearer as this dis- 
cussion develops. 


however 


summarizer of a 


If our clinical practice reflects (as I believe 


Southwestern 
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it does) the use of some concept of normality, 
it becomes appropriate to inquire into what 
kind of meaning can be ascribed to it. The 
controversy over the idealist-adjustive vs. the 
statistical-average interpretations of normality 
has obscured some more basic considerations. 
Why should the question, “What is normal- 
ity?” have a different logical status than, say, 
the question, “What is schizophrenia?” That 
a diagnosis of schizophrenia, for instance, im- 
plies a hypothesization of an inner structure 
or state which is but inadequately described 
by an operational definition, and that such 
taxonomic conceptualization is a scientifically 
valuable activity has been convincingly argued 
by Meehl (1959). Although our present day 
conception of schizophrenia is riddled through 
with indeterminacies, we would be hard put 
to do without it—clinically or scientifically. 
The concept of normality suffers from the 
same type of inadequacies, only more so, since 
it belongs to a far less imposing nomological 
network than does schizophrenia. 

In the case of both schizophrenia and nor- 
mality we are asking legitimate questions to 
which, unfortunately, no sufficiently complete 
or specific answer can be given at this time. 
But scientific theory as it is understood today 
(Hempel, 1952) accepts vagueness or open 
endedness in the real definitions (i.e., defi- 
nitions involving a statement of the essen- 
tial nature or characteristics) of concepts. We 
should understand the question, ““What is nor- 
mality?” not as a demand for an unequivo- 
cal definition (as Freides implicitly does) but 
as a request for an empirically based specifi- 
cation of the indicators (prevalence of con- 
trol factors? level of corticoseptal integration? 
degree of reasonableness? appropriateness of 
autonomic arousal? etc.) which may be found 
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to reflect with varying probability the exist- 
ence of this hypothetical state. During the 
early stages of a science, the specification of 
the meaning of concepts is, as Kaplan (1946) 
notes, “a provisional one, both as to the indi- 
cators included aad the weights associated 
with them.” 

The definition of normality, the hunt for 
the indicators of its essential characteristics, is 
thus construed as a processive affair marked 
by high responsivity to new data. An illustra- 
tion incorporating some suggestive data might 
be of interest. To begin with, let us consider 
a first temporary indicator of normality such 
as nonexposure to psychiatric diagnosis and 
treatment. As an operational definition of 
normality, this criterion would obviously be 
grossly inadequate as would many others. Our 
gambit rests on the hope that it may eventu- 
ally enable us to lift ourselves by our boot- 
straps (Cronbach & Meehl, 1955) by calling 
attention to other “purer” criteria which cor- 
relate (imperfectly) with it and which may 
in turn possess greater validity than our origi- 
nal indicator does. Many of our more success- 
ful constructs in psychology (e.g., Binet IQ) 
have evolved in this way. Using such a first 
indicator we would orient ourselves to dis- 
covering what other indicators (and charac- 
teristics by implication) compose a matrix of 
significant relatedness. It is clear at this point 
that this approach, in contrast to the idealist 
or average views of normality, would lead us 
to an empirically based conceptualization of 
normality having the essential character of a 
theoretical construct. We might pause to ask, 
how much reliable data do we happen to have 
concerning these “nonexposed” (or otherwise 
discriminated) normals? Surprisingly little. 
From reading the psychological literature a 
Martian might be led to believe that the pro- 
portion of schizophrenics in the United States 
is 80% instead of .8% or that the nonexposed 
represent only some 10% of the population 
when they should number closer to 90%. 

A perusal of the literature dealing with such 
loosely defined normals leaves one major im- 
pression: there is an unexpectedly high dis- 
tribution of “pathogenic” traits, histories, be- 
haviors, in this population while by contrast 
many of the allegedly normal (ideal-adjusted ) 
characteristics we have been taught to expect 
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are by no means typical of it (Lapousse & 
Monk, 1958; Renaud & Estess, 1955; Scho- 
field & Balian, 1959). For our purposes this 
would suggest that quite a few of the admi- 
rable, clearly nonpathogenic traits and behav- 
iors with which we have traditionally invested 
normality relate poorly to this concept as de- 
fined (preliminarily) by the nonexposure and 
other similar indicators. Parenthetically, this 
represents an attack on the same idealist- 
adjusted definition of normality that Freides 
finds unsatisfactory—without, however, elimi- 
nating normality as an object of scientific 
search. On the contrary, research on normal- 
ity should be quickened and stimulated by 
such findings, presuming they stand up under 
more systematic investigation. Are there, for 
instance, suppressor-control variables which 
override the effect of such pathogenic events 
(Schofield & Balian, 1959), and are they the 
indicators of normality? Or will the answer 
involve a favorable interaction between cer- 
tain constitutional factors and patterns of 
personal history? We certainly do not know 
the answer at this time. There can be little 
doubt, however, that the concept of normal- 
ity constitutes an object of search that is not 
only scientifically appropriate but also of 
great moment to psychology. Such research 
emphasis may well be a factor in helping to 
precipitate a much needed shift away from 
the basic orientation to pathology that char- 
acterizes many of our efforts today. Clinical 
psychology needs to reacquaint itself with its 
most unique and natural subject matter— 
normal man. It is unfortunate, as Sanford 
(1958) has pointed out, that “we .. . talk 
much more freely about symptoms of illness 
than about symptoms of health or symptoms 
of resiliency or of strength or of spontaneity 
or of creativity” (pp. 82-83). 

Little has been said in this rejoinder con- 
cerning the aspect of the problem of normal- 
ity which bears Freides’ strongest criticisms, 
the evaluative component of normality. Our 
approach simply sidesteps this argument alto- 
gether. Normality as here conceived involves 
absolute values or cultural biases no more and 
no less than does, e.g., our concept of schizo- 
phrenia. Certainly normality is culture bound, 
but only to the extent that all concepts are 
“bound” to the phenomena they describe or 











Reply to Freides 


explain. Likewise, what kind of value judg- 
ment would be involved in the empirical de- 
termination that, e.g., a specific suppressor- 
control variable constitutes an essential char- 
acteristic of normality? As far as we can see 
the only values involved in this enterprise are 
the same that motivate all scientific research 
and as such they are rightly accepted as neu- 
tral with reference to the content of science. 

Although the bulk of this paper has been 
devoted to a defense of a conceptualization 
of normality, we might briefly consider the 
alternative that Freides champions, i.e., that 
we concern ourselves instead with the specific 
abilities and limitations that individuals dem- 
onstrate under specific circumstances. This 
point of view has a long history in psychology 
as the specificity theory of personality, and 
as such has been much discussed over the 
years. Some general comments, however, seem 
appropriate here. First, in addition to seeking 
the highest possible accuracy in the explana 
tion and prediction of behavior, science also 
attempts to be maximally comprehensive and 
parsimonious. Freides’ approach represents an 
admission of failure with respect to these de- 
siderata. Secondly, the two approaches are by 
no means incompatible. Classifying a person 
as “normal” does not by any stretch of the 
imagination preclude an appreciation of his 
limitations under certain circumstances. It 
can easily be conceived that broadly specified 
situational variables may play a significant 
role in behavior prediction equations. To what 
extent this is true we do not know. It is fur- 
ther possible, once this typological approach 
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has begun to yield some fruits, that we will 
be able to shift to systematic studies of the 
processes which favor normal psychological 
growth. It is clear, however, that psychology 
would be poorly served if we gave up our 
search for the meaning of normality—a po- 
tentially powerful explanatory and predictive 
construct. 
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THE EFFECTS OF TWO VERBAL TECHNIQUES 
ON THE EXPRESSION OF FEELINGS 


GUSTAV LEVINE ? 


Teachers College, Columbia University 


A problem frequently encountered in psy- 
chotherapy is the client’s use of impersonal, 
nonemotional statements. The therapist’s re- 
quests for statements referring to the client’s 
feelings are frequently met with descriptions 
of situations or impersonal observations, even 
though the client is highly motivated to co- 
operate. Therapists therefore frequently at- 
tempt to behave in a manner that facilitates 
their client’s emotional expression. 

Rogers, in his early writings, stated his 
observation that reflection of feelings results 
in the immediate expression of further feel- 
ings by the client (Rogers, 1942, p. 158). 
Studies of recorded therapy sessions which 
utilized categories of clients’ expressions of 
feelings and therapists’ reflections of feelings, 
have not examined the specific sequence of 
reflection and expression of feelings (Berg- 
man, 1951; Seeman, 1949; Snyder, 1945). 
Verplanck (1955), in an experiment involv- 
ing a social rather than a therapeutic situa- 
tion, found that paraphrasing, a technique 
which is descriptively similar to reflection, 
increased the class of statements paraphrased. 
Although feelings were not paraphrased, but 
rather statements of opinion, the results are 
encouraging to a hypothesis that reflection of 
feelings increases expression of feelings. In 
the Verplanck study, paraphrasing of the class 
of responses to be increased was the only re- 
sponse given by the interviewers. This re- 
stricted attention to one class of response may 


1 This paper is taken from portions of a thesis sub- 
mitted to the Department of Clinical Psychology, 
Teachers College, Columbia University, in partial 
fulfillment of the requirements for the PhD degree. 
The author wishes to express his appreciation to his 
Chairman, Joel Davitz, for his aid and encourage- 
ment. 
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have been the factor which reinforced this 
response (rather than the specific paraphras- 
ing technique), implying that any technique 
involving restricted attention to one class of 
response would reinforce that class. On the 
other hand, the specific technique (paraphras- 
ing) may be reinforcing in the same way that 
approval can be reinforcing (Murray, 1956). 
The contention that restricted attention is the 
significant factor is supported by the work of 
Salzinger and Pisoni (1958, 1960), who found 
that simple statements such as, “I see,” 
“Yeah,” “Uhha,”’ “Mmm-hmm,” etc., could 
act as positive reinforcers of expression of 
feelings. 

The present study involves a comparison of 
the specific technique of reflection of feelings 
with a simple undifferentiated vocalization, 
““Mm-hm,” beth applied as the only response 
given, in separate interviews, and occurring 
only after expressions of feeling. In such a 
pair of interviews both interviewer techniques 
would constitute restricted attention to a class 
of response, and any special advantage of the 
more complex response could express itself in 
more effective reinforcement. 

It was hypothesized that reflection of feel- 
ings results in greater expression of feelings 
than does an_ undifferentiated 
(“Mm-hm’’). 

The subjects were 30 male undergraduate 
students, living at college, who volunteered 
and were paid for their participation. 


vocalization 


The experiment was explained as a study of 
feelings about school life. The experimenter 
stated that he was “interested in what you are 
experiencing” as a student. The subjects each 
received two similarly structured interviews, 
which differed only in the way in which the 
experimenter responded to expression of feel- 
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ings by the subjects (reflection in one, “Mm- 
hm” in the, other). 

The interviews were scheduled one week 
apart and tape recorded. One half of the sub- 
jects received one condition first, the other 
half receiving the other condition first. 

The interviews were rerecorded with the 
interviewer's responses omitted, and with the 
tape divided into one-minute segments of talk 
by the subjects. The presence or absence of 
expression of feelings in a single minute was 
determined by six psychologists. These judges 
used a detailed set of instructions given to 
them by the experimenter as a frame of refer- 
ence for their judgments. 

The basic data for the testing of the hy- 
pothesis was the number of minutes contain- 
ing expressions of feeling. 

The one-minute rated for 
presence or absence of feeling by both the 
experimenter and a judge. The agreement was 
high, a phi coefficient of .92 having been ob- 
tained through a transformation of the com- 
puted chi square value of 428.9. 

The average number of minutes of feeling 
in each of the two interview conditions was 
computed. In the reflection interviews there 
was an average of 10.07 minutes containing 
expression of feelings, and in the “Mm-hm” 
interviews there was an average of 9.83 min- 
At 
test of the difference yielded a nonsignificant 
t of .23. 

The results indicate that under the condi- 
tions of the present experiment there is no 
difference in effectiveness reflection 


segments were 


utes containing expression of feelings. 


between 


The instructions are in the appendix to the thesis 


The thesis can be obtained on microfilm from Uni- 
versity Microfilms: 313 North First Street; Ann Ar 
bor, Michigan. Order L.C. Card No. Mic. 58 37 


remitting $2.00 


of feelings and the undifferentiated vocaliza- 
tion ‘““Mm-hm,” as techniques for increasing 
expression of feelings. The additional factor 
of clarification of feelings through re-expres- 
sion in different words does not seem to in- 
crease the frequency of expression of feelings 
beyond that obtained with any 
which responds only to feelings. 

The experiments of Verplanck (1955) and 
Greenspoon (1955), indicate that the two 
techniques can each be effective compared to 
no technique (operant level), but there was 
no control in the present experiment with 
which to make such a comparison. 


technique 
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BRIEF REPORTS 


ANXIETY 


IN VERBAL BEHAVIOR: 


AN INTERCORRELATIONAL STUDY! 


MERTON S. KRAUSE 


University of Michigan 


Several objective measures of anxiety in verbal 
behavior have been proposed in the last 3 dec- 
ades. Their claims to validity are relatively weak, 
generally resting upon their apparent reasonable- 
ness. If they appear to reflect the same “inner 
state” and so yield highly correlated measure- 
ments on the same behavior samples, their claims 
would be stronger. They might be said to show 
concurrent validity 

Ten-minute recorded samples from the therapy 
sessions of 15 hospitalized male mental patients 
were studied. Eight purported measures of anx- 
iety were applied to each patient’s verbal response 
in the 15 protocols. The measures were (a) num- 
ber of words spoken, (b) number of words/num- 
ber of inspirations, (c) number of verbs/number 
of adjectives, (d) latency of the response, (e) 
number of references to the interviewer, (f) and 
(g) number of speech disruptions as measured 
by Mahl (1956) “non-ah” ratio and Dibner 
(1956) Cue Count I and (/) rate of speech. 

An intercorrelation matrix was computed for 
each of the 15 protocols to study the amount of 
individual differences, and then the matrix of 
median intercorrelations was derived. The aver- 
age level of correlation in this later matrix was 


1 This study was supported under Grant M-516 
C-7 from the National Institute of Mental Health, 
E. S. Bordin, Principal Investigator. 
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without charge from Merton S. Krause (2343 Auburn 
Avenue; Cincinnati 19, Ohio) or for a fee from the 
American Documentation Institute. Order Document 
No. 6642, from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; Wash- 
ington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
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06, and so no general 
measures is evident 


convergence of the several 
If these measures do possess 
some degree of concurrent validity it was not 
uniform enough over persons to appear in our 
sample. The cluster pattern in the matrix of me- 
dian intercorrelations does suggest some conver- 
gences in our set of measures. As might be ex- 
pected by inspection of the measures themselves 
the two speech disruption 
correlated (rg; = .91) 


measures were highly 
while verbs/adjectives and 
number of words were moderately loaded on the 
same factor 2 and .24 
This pattern of Measures f, g, c, 
for about half of the protocols 
protocols, however 
grated. 


(about .4 respectively ) 
and a held up 
In at least 


this pattern clearly disinte- 


two 


The average results have a very large sampling 
error over The average 
range for the median values was .45 


persons interquartile 


Thus, there 


are marked individual differences in the level of 


intercorrelation among the various verbal anxiety 
measures. This implies that different measures 
may be valid for different persons and that what 
measurement values indicate anxiety or nonanx- 
iety may also be idiosyncrati¢ 


These results sug- 
gest that verbal measures are not going to be any 
less troublesome to validate than are physiologi- 
cal measures of anxiety 
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VERBAL AND PERCEPTUAL COMPONENTS IN WISC PERFORMANCE 
AND THEIR RELATION TO SOCIAL CLASS ' 


JOHN B 


Veterans Admini 


MARKS 


tration Hospital, American Lake, Washington 


snp JAMES E. KLAHN 


Tacoma Public 


Most investigators have found measured intelli- 
gence positively related to social class and have 
found this relation closer with verbal rather than 
nonverbal materials. With the WISC, however 
Estes (1953) found higher status superiority only 
at a 7-year-old level and not at the 10-year-old 
level. Moreover, she found no consistent pattern 


differences of subtests between her upper social 


group and her lower group. 

The present study relates social class to 
WISC measures of difference: 
(a) Verbal IQ-Performance IQ, and (b) the dif- 
ference between weighted scores of subtests highly 
loaded Cohen’s verbal factors and 
weighted scores of subtests highly loaded on the 
perceptual factor. This latter was the mean of In- 
formation, Comprehension, Similarities, and Vo- 
cabulary minus the mean of Picture Completion 
Block Design, and Object Assembly 

Subjects were 211 primary school children di- 
vided by age and sex into four groups. Both 
younger groups were within 6 months of their 
eighth birthday at testing while the older groups 
were within 6 months of their eleventh birthday 
All had been tested because of some school diffi- 
culty but children with IQs below 70 were elimi- 
nated in sample selection. Mean IQs of the sam- 


ple 


two 


verbal-nonverbal 


on (1959) 


were close to population means 

From information about the father’s occupa- 
tion each subject was assigned to an occupational 
class group ranging from 1, casual laborer, to 9, 
business leader. The r between independent rat- 


An extended report of this study may be ob- 
tained without charge from John B. Marks (Mental 
Health Research Institute; Fort Steilacoom, Wash- 
ington) or for a fee from the American Documenta- 
tion Institute. Order Document No. 6643, from ADI 
Auxiliary Publications Project, Photoduplication Serv- 
ice, Library of Congress; Washington 25, D. C., re- 
mitting in advance $1.25 for microfilm or $1.25 for 
photocopies. Make checks payable to: Chief, Photo- 
duplication Service, Library of Congress 


School 


ings of the two authors was 
two ratings were used. 

Occupational ratings correlated positively with 
IQs in both the younger and the older groups 
Though the correlation in the younger group is 
higher it is not significantly so. On the other 
hand, the girls show a substantially higher cor- 
relation than do the boys. For verbal and full- 
scale IQs this difference is significant, .42 com- 
pared to .19 for verbal and .45 to .17 in the full- 
scale. 


.94 and means of the 


The two measures of verbal-nonverbal differ- 
ence were tested for their relation to occupational 
level, both directly through correlations, and by 
contrasting the difference measures for unskilled 
and semiskilled labor children with those for 
white collar children. The difference between ver- 
bal and performance IQs was in the expected di- 
rection but not significant. The difference in the 
factor-derived measures had a significant r of .16 
(p < .05, N =211) with occupational level and 
the white collar children showed a greater verbal 
superiority (¢ = 2.41, p< .01, 113 df) 

These results are consonant now 
using other instruments. On the 
younger and older groups show a correlation of 
IQ with occupational class, and this correlation 
is higher when verbal materials are used than it is 
for nonverbal. The closer relation between occu- 
pational level and IQ among girls than among 
boys may stem from the higher peer value which 
girls put upon middle class verbality 


with results 
WISC. both 
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The study was designed to test the hypothesis 
that sensory deprivation lowers the threshold for 
projection. To test this hypothesis, a control 
group and an experimental group of 10 subjects 
each were used, half male, half female. The con- 
trol subjects received no deprivation and were 
tested individually for projection using a modi- 
fied autokinetic technique. The technique con- 
sisted of presenting the subject with a dim source 
of light approximately 1 mm. in diameter at 
distance of 9 feet. The subject was told to watch 
the moving pinpoint of light and, when it went 
off, to report what it suggested, or looked like, or 
made him think of. There were 12 1-minute trials 
with a 2-minute rest period after the sixth trial 

The experimental subjects were exposed to sen- 


a 


1 An extended report of this study may be obtained 
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DEPRIVATION AND ITS RELATION 


ROBERTSON 


i) 


TO PROJECTION ! 


snp ROBERT C. MARTIN 


ersity of Florida 


hours and then tested im- 
technique the 
subjects. In the depri- 


sory deprivation for 


iutokinetic in 


mediately with the 
same manner as the control 


vation condition, each subject wearing opaque 
goggles, cotton mittens, and cardboard cuffs. was 


placed on a bed with his head inside a foam rub- 
ber lined box 

The two groups were con pared in terms of the 
number of ll the number of 


responses as Well 
original responses, and 


as 
stimulus-bound responses 
popular responses. Stimulus-bound responses were 
those that referred solely to the movement or di- 
rection of movement of the light. An original re- 
sponse was one that 


and in addit 


vas given only once, by only 
considered 
very novel 
was one that was found 


one 
the investigators to | 
usual. A popular response 
in 


person ion Was 


Vy 


two Ve or un- 


1 
sua 


several records, u occurring more than 
once in a record 

It was expected that the 
would show more projection (greater productiv- 
ity, larger number of original nd 
fewer popular and _ stin 
than the control group. Differences between the 


two groups were not statistically significant 


deprivation group 


responses, a 


ulus-bound responses ) 
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PSYCHIATRIC OUTPATIENT PERSONALITY PATTERNS ' 


MARY HELEN TATOM 
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In a previous study (Tatom, 1958) five patients 


tors in the two analyses suggested that syndromes 
were selected on the basis of medical diagnoses 


were replicated in the second analysis, though 


as representatives of each of four commonly oc- overlap of specific items was not extensive (e.g 
curring nosological entities: obsessive-compulsive, two internally consistent clusters of clinically 
hysteric, anxiety state, and outpatient schizo- schizoid traits had in common only seclusivenes 
phrenic. They were rated by their respective in- and poor social adjustment). Primary factors 


dividual therapists on each of 67 personality vari- were tentatively identified with the respective 
ables, and the resulting scores intercorrelated factors of the original analysis as follows: out- 
The 20 X 20 matrix of person-to-person correla- patient paranoid schizophrenic, 


conversion hy 
tions was factored by Thurstone’s centroid method _ teric, 


socially mature personality, and pa 

to yield four primary and two second-order fac- dependent personality vs. antiso 

tors, tentatively identified with clinical syndromes 
To test the stability of these patterns, 2 refer- 

ence individuals for each of the four primary fac- 


personality 
The latter was less clearly defined by reference 
individuals than were the other three. Patients in 
both analyses were grouped largely on the basis 
tors were inserted into a matrix with 16 new pa-_ of second order factors, rather than highly cor 
tients, selected to represent equally the original related primary factors. Fac 
four clinical diagnostic entities. A factor analysis tended to confirm two second-order factors 
paralleling the first was carried out. The position schizothymia vs 
of reference individuals with respect to primary emotionality vs 


tor and trait patterns 


cyciothymia, and uncontrolled 


overcontrolled emotionality 


factors in the two analyses is given in Table 1 Psychiatric 


and factorial classification of pa 
Trait patterns of structurally corresponding fac- tients did not agree in either analysis; syndromes 


isolated factorially corresponded only roughly to 
An extended report of this study may be ob . : 


. : a the nosological entities represented by patients in 
tained without charge from Mary Helen Tatom (641 ; : ~~ St 


, both groups of tients analvzed 
Houston Ave., Apartment 202; Takoma Park, Mary h groups of patients analyzed 
land) or for a fee from the American Documenta 
tion Institute, Order Document No. 6645 from ADI REFERENCE 
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Tatom, Mary H. A factorial isolation of psychiatric 
outpatient syndromes. J. consult. Psychol., 1958 
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rABLE 1 
PROFILES OF FacToRS IN REFERENCE INDIVIDUALS 
Factor 

Reference 

Individual \ \ B B ( e D D 
O-5 (A 60 66 20 OO 02 Os 26 13 
0-3 (A 67 87 03 .06 OO 05 05 - 07 
\-1 20 27 49 71 11 28 01 18 
\-4 (B 00 02 57 =A 01 01 O4 14 
H-3 (C 02 ()2 01 06 69 71 03 00 
A-2(C O8 39 4 OC O1 58 10) 26 51 
H-2 (D 26 64 11 38 02 39 19 37 
H-4 (D 13 39 0O 52 04 42 49 3 
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IDENTIFICATION IN TERMS OF PERSONAL CONSTRUCTS: 
RECONCILING A PARADOX IN THEORY ! 


ROBERT E. JONES 
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In this study, identification is defined as per- 
ceived similarity of self and others, experienced 
in terms of personal constructs. The author ac- 
cepts the definition of phenomenological psy- 
chologists and the thinking of psychoanalysts, 
such as R. P. Knight, who emphasize identifica- 
tion as a relationship rather than a process, and 
as a perceived rather than as an actual similarity. 

The psychology of personal constructs, the 
theoretical system developed by G. A. Kelly, is 
a perceptual approach to the prediction and ex- 
planation of human behavior. One’s personal con- 
structs are the vehicles, verbally expressed, by 
which one anticipates the behavior of others and 
guides his own behavior. Constructs are dimen- 
sions defined by terms which the perceiver ac- 
cepts as opposites or contrasts. The way in which 
two persons are seen as alike yet different from a 
third would be a construct. A form of the Role 
Construct Repertory Test is employed to meas- 
ure identification with “significant others.” 

Identification with male figures in the Reper- 
tory test was taken as the central measure in the 
present research. Repeat-test reliability was es- 
tablished in the high .80s. The subjects were 36 
hospitalized males with mild or moderate psychi- 
itric disorders, and a control group of 36 normal 
males, matched on age and education. 

The central hypotheses were two: (a) neuro- 
psychiatric (NP) patients more often than nor- 
mal adult males will either overidentify or under- 


' An extended report of this study may be ob 
tained without charge from Robert E. Jones (Psy- 
chology Service, Veterans Administration Hospital; 
Danville, Illinois) or for a fee from the American 
Documentation Institute. Order Document No. 6646 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress; Washing- 
ton 25, D. C., remitting in advance $1.75 for micro- 
film or $2.50 for photocopies. Make checks payable 
to: Chief, Photoduplication Service, Library of Con- 
gress. 

This paper draws on data collected when the au- 
thor was engaged in doctoral research at Ohio State 
University. 


276 


identify with personally significant male figures; 
and (b) the personal construct matrices of NP 
patients will be simpler than those of controls. 

The hypothesis of underidentification is sup- 
ported only at the 10% level, by one-tailed ¢ test 
The hypothesis of overidentification is significant 
at the 1% level: NPs are more likely to see 
others as extremely like the self than are nor- 
mals. Also as predicted, the idiographic factors 
required to “explain” the interpersonal matrix 
are significantly fewer (at the .05 level) for NPs 
than for the controls. For NPs, but not for nor- 
mals, the more simple the factor matrix the more 
fully it is explained by a value construct (.01 chi 
square significance). 

The major contribution of the Personal Con- 
struct approach to identification theory lies in the 
reconciliation of previous theories. Both E. H. 
Erikson and O. H. Mowrer are partly right. Both 
over- and underidentification are common badges 
of maladjustment. Both are associated with a 
common defect—a factorially simple, value-laden 
system of constructs. Dimensions of perception 
permitting useful discrimination become inoper- 
ant. With the construct system compelling polari- 
zation into “good guys and bad,” we see, with 
Sanford, how “desperation” promotes classical 
identification of the all-or-none variety. Identifi- 
cation with a hated object tends to be uncon- 
scious and accomplished by introjection. Identifi- 
cation with an idealized object tends to be con- 
scious and achieved by assimilative projection 
Sanford’s “identification proper” is the uncon- 
scious type, always desperation motivated. Either 
type of identification, whether excessive or defi- 
cient, can be explained in terms of an oversimpli- 
fied construct system, preempted by the value 
dimension. 

The findings support Rinder and Campbell’s 
contention that both over- and underidentifica- 
tion reflect the same neurotic dynamic: in their 
terms, “undue reaction-sensitivity”; in ours, un- 
due perceptual restriction to the value dimension 


(Received September 22, 1960) 
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pretation is based on careful factor- 
analytic studies? Which has norms for 
ll-year-old through adult levels (over 
1900 selected Ss)? Uses an interesting 
and disarming cartoon-like format? 


Cassel and Kahn 


GROUP 
PERSONALITY 
PROJECTIVE 
TEST 


Order Examiner’s Set including gen- 
eral manual, scoring keys, 10 test book- 
lets, and 100 answer sheets, enough ma- 
terial for a gond-sized testing program 
(and only $13.50). 


o 
For further information, write to 


PSYCHOLOGICAL TEST 
SPECIALISTS. 
Box 1441 
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outstanding therapeutic, educational, and voco- 
tionai services are available. 


Serving the East Coast, Devereux Schools are located at Devon, Pennsylvania (Mr. 
Charles J. Fowler, Director of Admissions); serving the West Coast, Devereux Scheols at 
Sento Barbera, California (Mr. Keith A. Seaton, Registrar); and serving the Southwest, 
Devereux Schowls at Victoria, Texas (Mr. John M. Barclay, Director of Development). 
These will welcome your inquiry. 
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